multi-armed-bandits

Development of algorithms for reinforcement learning. Specifically, software implementation of the algorithms and policies described in the paper Batched Multi-armed Bandits Problems, by Zijun Gao, Yanjun Han, Zhimei Ren, Zhengqing Zhou.

python reinforcement-learning multi-armed-bandits

Updated Jul 22, 2020
Python

diyabodiwala / FlicksMAB

Star

FlicksMAB is a movie recommendation system that leverages the power of multi-armed bandits (MAB) to personalize movie suggestions for users. Built using PyTorch, this system uses the MovieLens 100K dataset to learn user preferences and recommend movies that are likely to engage them.

deep-learning pytorch recommendation-system multi-armed-bandits movielens-dataset evaluation-metrics

Updated Jun 29, 2024
Python

Murtazali05 / Multi-armed-bandit

Star

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

Sudhansh6 / Intelligent-Learning-Agents

Star

A repository covering a range of topics from multi-arm bandits to reinforcement learning algorithms. Check out different applications of bandits, MDPs and RL algorithms along with theoretical aspects.

python reinforcement-learning markov-decision-processes multi-armed-bandits tile-coding sarsa-lambda open-gym-ai

Updated Oct 10, 2023
Python

elina-israyelyan / thompson-sampling

Star

Package to implement the Thompson Sampling algorithm.

thompson-sampling multi-armed-bandits multi-armed-bandit

Updated May 14, 2022
Python

MehranTaghian / prophet-inequlity-implementation

Star

Implementation of the prophet inequalities

multi-armed-bandits bandits prophet-inequality k-prophet

Updated Dec 11, 2021
Python

mobarski / kraken

Star

Contextual Bandit Engine

multi-armed-bandits multi-armed-bandit contextual-bandits multiarm-bandit multiarmed-bandits

Updated Aug 16, 2023
Python

ml4wifi-devs / mapc-mab

Star

IEEE 802.11bn Multi-AP Coordinated Spatial Reuse with Hierarchical Multi-Armed Bandits

machine-learning reinforcement-learning multi-armed-bandits coordinated-spatial-reuse

Updated May 11, 2024
Python

k123abc / content-recommender-design

Star

This repository addresses popular content recommender design for public transportation as discussed in my Ph.D. thesis titled as: "Popular Content Distribution in Public Transportation Using Artificial Intelligence Techniques". The code used for the entire content recommender design is provided twice using two programming languages, namely: Pyth…

artificial-intelligence collaborative-filtering recommender-systems multi-armed-bandits vehicular-networks vanets wireless-networks content-distribution ad-hoc-network

Updated Feb 16, 2021
Python

proceduralia / randomist

Star

Code for Policy Optimization as Online Learning with Mediator Feedback

thompson-sampling exploration mcmc multi-armed-bandits policy-optimization

Updated Dec 27, 2020
Python

sarthakmittal92 / multi-armed-bandits

Star

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

royhzq / bayesian-ab-django

Star

An implementation of Bayesian AB testing framework in Django. Implements multi-armed bandit algorithms such as Thompson Sampling and UCB1. API for registering impressions/conversions implemented with django-rest framework

d3 django django-rest-framework multi-armed-bandits bayesian-statistics