#

contextual-bandits

Here are 28 public repositories matching this topic...

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

reinforcement-learning tensorflow dqn multi-armed-bandits bandits contextual-bandits rl-algorithms tf-agents

Updated Jul 8, 2024
Python

david-cortes / contextualbandits

Python implementations of contextual bandits algorithms

reinforcement-learning contextual-bandits multiarmed-bandits exploration-exploitation

Updated Jun 18, 2024
Python

st-tech / zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

research datasets multi-armed-bandits contextual-bandits off-policy-evaluation

Updated Jun 3, 2024
Python

fidelity / mabwiser

[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library

machine-learning recsys multi-armed-bandits contextual-bandits parametric-bandits non-parametric-bandits

Updated Feb 19, 2024
Python

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

reinforcement-learning neural-network pytorch thompson-sampling reinforcement-learning-algorithms machine-learning-library neural-architecture-search contextual-bandits mab pytorch-implemention multiarmed-bandits pytorch-implementation thompson-algorithm

Updated Dec 11, 2019
Python

alison-carrera / mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

banditml / banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

reinforcement-learning pytorch personalization neural-networks bandits contextual-bandits

Updated Jun 4, 2021
Python

instadeepai / catx

🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX

python deep-learning contextual-bandits jax

Updated Oct 7, 2022
Python

lil-lab / blocks

Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)

machine-learning natural-language-processing reinforcement-learning natural-language-understanding contextual-bandits

Updated Feb 7, 2019
Python

pemami4911 / sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

reinforcement-learning deep-learning combinatorial-optimization permutation-algorithms contextual-bandits

Updated Aug 27, 2018
Python

improve-ai / python-ranker

Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions

python machine-learning reinforcement-learning ai personalization xgboost ab-testing recommender-system multi-armed-bandit multivariate-testing contextual-bandits improve-ai

Updated Jun 9, 2023
Python

hitl-ab-bpm

aaronkurz / hitl-ab-bpm

Business Process Improvement with Reinforcement Learning and Human-in-the-Loop.

reinforcement-learning bpmn webapp ab-testing bpm ab-tests business-process contextual-bandits bpi

Updated May 2, 2023
Python

zaid-g / ccb_tutorial

Contextual multi-armed bandit recommender system using Vowpal Wabbit

tutorial reinforcement-learning recommender recommender-system contextual-bandits

Updated Apr 10, 2022
Python

improve-ai / tracker-trainer

Contextual Multi-Armed Bandit Reward Tracker & Model Trainer

python aws machine-learning reinforcement-learning ai aws-lambda serverless ml personalization xgboost serverless-framework parquet ab-testing recommender-system decision-trees multi-armed-bandit contextual-bandits improve-ai

Updated May 21, 2024
Python

ngutowski / algossim

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated Dec 7, 2021
Python

doerlbh / dilemmaRL

Code for our PRICAI 2022 paper: "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior".

machine-learning reinforcement-learning game-theory multiplayer-game behavioral-cloning multiagent-systems human-behavior bandits contextual-bandits prisoner-dilemma

Updated Aug 27, 2022
Python

radsn23 / bandits-codes

Bandits codes contributed by Louie Hoang at MSR.

reinforcement-learning-algorithms contextual-bandits bandits-codes

Updated Oct 26, 2020
Python

TheAmazingElys / NeuralBandit

Code of the NeuralBandit paper

reinforcement-learning neural-networks contextual-bandits

Updated Mar 12, 2021
Python

Murtazali05 / LinUCB

LinUCB with disjoint linear models

contextual-bandits linucb

Updated Apr 14, 2021
Python

pm3310 / pulpo

WIP: A library and AWS sdk for non-contextual and contextual Multi-Armed-Bandit (MAB) algorithms for multiple use cases

aws reinforcement-learning multi-armed-bandits contextual-bandits sagemaker

Updated Apr 4, 2020
Python

Improve this page

Add a description, image, and links to the contextual-bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the contextual-bandits topic, visit your repo's landing page and select "manage topics."