v-i-s-h / MAB.jl Public

Notifications You must be signed in to change notification settings
Fork 8
Star 20

A Julia Package for providing Multi Armed Bandit Experiments

20 stars 8 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
docs		docs
examples		examples
src		src
test		test
.DS_Store		.DS_Store
.codecov.yml		.codecov.yml
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE.md		LICENSE.md
README.md		README.md
REQUIRE		REQUIRE
appveyor.yml		appveyor.yml

Repository files navigation

MAB.jl - A Package for Bandit Experiments

This package provide a framework for developing and comparing various Bandit algorithms

Available Algorithms

Uniform Strategy (Randomly picking some arm)
ϵ-greedy
1. ϵ-greedy
2. ϵ_n greedy
Upper Confidence Bound Policies
1. UCB1
2. UCB-Normal
3. UCB-V
4. Bayes-UCB (For Bernoulli Rewards)
5. KL-UCB
6. Discounted-UCB
7. Sliding Window UCB
Thompson Sampling
1. Thompson Sampling
2. Dynamic Thompson Sampling
3. Optimistic Thompson Sampling
4. TSNormal (Thompson Sampling for Gaussian distributed rewards)
5. Restarting Thompson Sampling
6. TS With Gaussian Prior
EXP3
1. EXP3
2. EXP3.1
3. EXP3-IX
SoftMax
REXP3
Gradient Bandit

Available Arm Models

Bernoulli
Beta
Normal
Sinusoidal (without noise)
Pulse (without noise)
Square
Variational (without noise)