#

contextual-bandits

Here are 59 public repositories matching this topic...

jtcho / FairMachineLearning

Implementation of provably Rawlsian fair ML algorithms for contextual bandits.

python machine-learning jupyter numpy multi-armed-bandits upenn contextual-bandits

Updated May 10, 2017
Jupyter Notebook

pemami4911 / sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

reinforcement-learning deep-learning combinatorial-optimization permutation-algorithms contextual-bandits

Updated Aug 27, 2018
Python

lil-lab / blocks

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

machine-learning natural-language-processing reinforcement-learning natural-language-understanding contextual-bandits

Updated Feb 7, 2019
Python

rl-study / Contextual-Bandits-Simple-Example

machine-learning reinforcement-learning contextual-bandits

Updated Mar 10, 2019
Jupyter Notebook

hsm207 / cb-trading

Code to trade the financial markets using Contextual Bandits

reinforcement-learning trading-bot trading-strategies contextual-bandits

Updated Nov 20, 2019
Jupyter Notebook

g0ulash / awesome-bandits

Awesome list about anything bandit problems

awesome awesome-list multi-armed-bandits bandit contextual-bandits

Updated Nov 27, 2019

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

reinforcement-learning neural-network pytorch thompson-sampling reinforcement-learning-algorithms machine-learning-library neural-architecture-search contextual-bandits mab pytorch-implemention multiarmed-bandits pytorch-implementation thompson-algorithm

Updated Dec 11, 2019
Python

twkillian / nonstationary_contextual_bandits

Repo for course CSC2558: "Intelligent Adaptive Interventions" project in nonstationary contextual bandits.

thompson-sampling contextual-bandits bayesian-linear-regression non-stationary-environment numpyro

Updated Jan 1, 2020
Python

aldente0630 / multi_armed_bandit

Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset

contextual-bandits mab striatum

Updated Jan 2, 2020
Jupyter Notebook

pm3310 / pulpo

WIP: A library and AWS sdk for non-contextual and contextual Multi-Armed-Bandit (MAB) algorithms for multiple use cases

aws reinforcement-learning multi-armed-bandits contextual-bandits sagemaker

Updated Apr 4, 2020
Python

doerlbh / ABaCoDE

Code for our ICDMW 2018 paper: "Contextual Bandit with Adaptive Feature Extraction".

reinforcement-learning feature-extraction icdm representation-learning bandits contextual-bandits nonstationary icdm2018

Updated Jun 15, 2020
MATLAB

Nth-iteration-labs / contextual

Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies

Updated Jul 25, 2020
R

doerlbh / BerlinUCB

Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".

reinforcement-learning paper semi-supervised-learning bandits bandit contextual-bandits contextual-bandit self-supervised-learning nonstationary-environments

Updated Sep 21, 2020
MATLAB

radsn23 / bandits-codes

Bandits codes contributed by Louie Hoang at MSR.

reinforcement-learning-algorithms contextual-bandits bandits-codes

Updated Oct 26, 2020
Python

travisbrady / ocaml-vw

OCaml bindings to vowpal wabbit

machine-learning reinforcement-learning ocaml classification vowpal-wabbit contextual-bandits

Updated Nov 24, 2020
OCaml

marlesson / meta-bandit-selector

The Contextual Meta-Bandit (CMB) can be used to select models using the context with online learning based on Reiforcement Learning problem. It's can be used for recommender system ensemble, A/B test, and other dynamic model selector problem.

machine-learning recsys contextual-bandits metalearning

Updated Feb 6, 2021
Jupyter Notebook

TheAmazingElys / NeuralBandit

Code of the NeuralBandit paper

reinforcement-learning neural-networks contextual-bandits

Updated Mar 12, 2021
Python

Murtazali05 / LinUCB

LinUCB with disjoint linear models

contextual-bandits linucb

Updated Apr 14, 2021
Python

aerrowfar / ML-ESports-Match-Prediction

Predicting the outcome of League of Legends E-Sports matches using reinforcement learning contextual bandits.

reinforcement-learning contextual-bandits

Updated Apr 29, 2021
Python

banditml / banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

reinforcement-learning pytorch personalization neural-networks bandits contextual-bandits

Updated Jun 4, 2021
Python

Improve this page

Add a description, image, and links to the contextual-bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the contextual-bandits topic, visit your repo's landing page and select "manage topics."