multi-armed-bandits

This project is created for the simulations of the paper: [Wang2021] Wenbo Wang, Amir Leshem, Dusit Niyato and Zhu Han, "Decentralized Learning for Channel Allocation inIoT Networks over Unlicensed Bandwidth as aContextual Multi-player Multi-armed Bandit Game", to appear in IEEE Transactions on Wireless Communications, 2021.

machine-learning-algorithms multi-armed-bandits multi-agent-reinforcement-learning

Updated Oct 12, 2021
Python

cfoh / Multi-Armed-Bandit-Example

Star

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, Deep MAB.

machine-learning reinforcement-learning recommendation-system multi-armed-bandits

Updated Oct 1, 2023
Python

nphdang / Bandit-BO

Star

Bayesian Optimization for Categorical and Continuous Inputs

machine-learning optimization thompson-sampling hyperparameter-optimization hyperopt gaussian-processes bayesian-optimization multi-armed-bandits hyperparameter-tuning automl automated-machine-learning smac categorical-variables continuous-variable acquisition-functions gpyopt batch-bayesian-optimization

Updated Jul 20, 2020
Python

Kenza-AI / mab-ranking

Star

Online Ranking with Multi-Armed-Bandits

learning-to-rank multi-armed-bandits online-learning online-learning-algorithms mab-ranking online-learning-to-rank

Updated Sep 4, 2021
Python

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

tuyenta / IoT-MAB

Star

Decentralized Intelligent Resource Allocation for LoRaWAN Networks

lorawan multi-armed-bandits resource-allocation

Updated May 9, 2019
Python

adik993 / reinforcement-learning-sutton

Star

reinforcement-learning q-learning sarsa gridworld multi-armed-bandits random-walk racecar bandit-algorithm sutton-book td-lambda dyna-q cliffwalking

Updated Mar 4, 2020
Python

paulozip / beer-recommender-mab

Star

A beer recommendation system using multi-armed bandit approach to solve cold start problems

python recommendation-system multi-armed-bandits multiarmed-bandits

Updated Oct 30, 2020
Python

machinelearningnuremberg / DyHPO

Star

[NeurIPS 2022] Supervising the Multi-Fidelity Race of Hyperparameter Configurations

deep-learning neural-networks hyperparameter-optimization convolutional-neural-networks gaussian-processes multi-armed-bandits hpo state-of-the-art deep-kernel-learning multi-fidelity neurips-2022 grey-box deep-kernel-gps

Updated Apr 25, 2023
Python

ardaegeunlu / X-armed-Bandits

Star

Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.

reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms multi-armed-bandits multi-armed-bandit

Updated Jul 12, 2018
Python

thetawom / mabby

Star

A multi-armed bandit (MAB) simulation library in Python

python reinforcement-learning simulation probability artificial-intelligence thompson-sampling epsilon-greedy multi-armed-bandits agent-based-simulation

Updated Jul 15, 2024
Python

ardaegeunlu / Non-Stochastic-Bandit-Slate-Algorithms

Star

Implementations of the bandit algorithms with unordered and ordered slates that are described in the paper "Non-Stochastic Bandit Slate Problems", by Kale et al. 2010.

machine-learning reinforcement-learning machine-learning-algorithms multi-armed-bandits multi-armed-bandit machine-learning-models

Updated May 30, 2018
Python

nphdang / turbo_bbo_neurips_2020

Star

An improved version of Turbo algorithm for the Black-box optimization competition organized by NeurIPS 2020

machine-learning turbo thompson-sampling hyperparameter-optimization classification decay gaussian-processes bayesian-optimization multi-armed-bandits hyperparameter-tuning automated-machine-learning acquisition-functions batch-bayesian-optimization

Updated Jan 14, 2021
Python

FlynnOwen / multi-armed-bandits

Star

Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.

python reinforcement-learning machine multi-armed-bandits multi-armed-bandit

Updated Apr 6, 2024
Python

Improve this page

Add a description, image, and links to the multi-armed-bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-armed-bandits topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-armed-bandits

Here are 48 public repositories matching this topic...

tensorflow / agents

st-tech / zr-obp

fidelity / mabwiser

rlberry-py / rlberry

bayesianbandits / bayesianbandits

kulinshah98 / Multi-Armed-Bandit-Algorithms

wbwang2020 / MP-MAB

cfoh / Multi-Armed-Bandit-Example

nphdang / Bandit-BO

Kenza-AI / mab-ranking

akshaykhadse / reinforcement-learning

tuyenta / IoT-MAB

adik993 / reinforcement-learning-sutton

paulozip / beer-recommender-mab

machinelearningnuremberg / DyHPO

ardaegeunlu / X-armed-Bandits

thetawom / mabby

ardaegeunlu / Non-Stochastic-Bandit-Slate-Algorithms

nphdang / turbo_bbo_neurips_2020

FlynnOwen / multi-armed-bandits

Improve this page

Add this topic to your repo