bandit-algorithms

Star

Here are 85 public repositories matching this topic...

rssalessio / py-lower-bound-bai

Star

Python utilities to compute a lower bound of the expected sample complexity to identify the best arm in a bandit model

bandit-algorithms lower-bound best-arm-identification sample-complexity

Updated Sep 8, 2021
Python

fouratifares / RGL

Star

Randomized Greedy Learning Under Full-bandit Feedback

agent machine-learning reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms machinelearning bandit-learning submodular-optimization submodularity bandit-algorithms

Updated Jan 22, 2024
Python

Rajarshi1001 / CS780

Star

Repository contains codes for the course CS780: Deep Reinforcement Learning

reinforcement-learning-algorithms monte-carlo-simulation ddpg-algorithm bandit-algorithms d3qn dqn-pytorch policy-based-method td3-pytorch gymnasium-environment

Updated Apr 17, 2024
Jupyter Notebook

szrlee / GPT-HyperAgent

Star

The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.

pipeline decision-making alignment gpt bandit-algorithms content-moderation online-decision-transformer human-ai-collaboration

Updated Jul 19, 2024
Python

Acemad / alphaNTBEA

Star

An Implementation of the N-Tuple Bandits Evolutionary Algorithm.

java genetic-algorithm artificial-intelligence evolutionary-algorithms bandit-algorithms noisy-optimization game-agent-optimization

Updated Nov 6, 2021
Java

alextanhongpin / bandit-learn

Sponsor

Star

A knowledge base for Bandit Algorithm

bandit-algorithms

Updated Oct 5, 2019

hughrawlinson / bandit-algorithms

Star

🎩🤠Some Bandit Algorithms in Typescript

learning optimization bandit-algorithms

Updated Aug 27, 2021
TypeScript

jajajang / LowPopArt

Star

2024 ICML Official code

reinforcement-learning-algorithms bandit-algorithms low-rank-matrix-recovery

Updated Oct 24, 2023
C

rafaol / no-regret-approximate-inference-via-bo

Star

Code repository for the paper No-Regret Approximate Inference via Bayesian Optimisation, published at UAI 2021

bayesian-inference mcmc gaussian-processes bayesian-optimization approximate-inference bandit-algorithms pytorch-implementation

Updated Oct 21, 2022
Jupyter Notebook

2ailesB / RLD

Star

An implementation of the TME from the Reinforcement Learning course given at Sorbonne University.

machine-learning reinforcement-learning gan dqn vae reinforcement-learning-algorithms ddpg imitation-learning policy-iteration value-iteration sac normalizing-flows curriculum-learning ppo gail bandit-algorithms maddpg

Updated Sep 9, 2022
Jupyter Notebook

anselmeamekoe / Graphs_in_ML_MVA

Star

semi-supervised-learning recommender-system bandit-algorithms graph-neural-networks graphs-theory

Updated Jul 11, 2021
Jupyter Notebook

siavashadpey / MultiArmedBandits

Star

reinforcement-learning active-learning bandit-algorithms exploration-exploitation

Updated Mar 27, 2022
Python

4dnaanM / bandits

Star

bandit-algorithms

Updated Jul 25, 2024
C++

rojagtap / value-function-methods

Star

Implementation of greedy, ε-greedy and softmax methods for n-armed bandit problem

python reinforcement-learning numpy reinforcement-learning-algorithms matplotlib bandit-algorithms n-armed-bandit-problem

Updated Jun 17, 2023
Jupyter Notebook

jialinyi94 / matching-bandit

Star

An implementation of the matching bandit algorithm in http:https://proceedings.mlr.press/v139/sentenac21a.html.

matching online-learning bandit-algorithms

Updated Oct 18, 2021
Jupyter Notebook

jialinyi94 / xbandits

Star

Vectorized bandit algorithms implemented in NumPy and Cython

reinforcement-learning recommender-system optimization-algorithms bandit-algorithms

Updated Jul 7, 2024
Python

park-jihoo / RL_TIL

Star

Today I Learned - Reinforcement Learning

data-science reinforcement-learning artificial-intelligence today-i-learned bandit-algorithms

Updated Jul 15, 2024
Python

JurajZelman / multi-armed-bandits

Star

Several multi-armed bandit strategies with additional holding option for smoother exploration.

optimization multi-armed-bandits bandit-algorithms

Updated Jul 16, 2024
Jupyter Notebook

hamzaghojaria / Ads_CTR_ThompsonSampling

Star

Ads Click-through rate using thompson sampling

data-science machine-learning algorithms numpy scikit-learn ads pandas artificial-intelligence thompson-sampling matplotlib ctr-prediction ctr bandit-algorithms

Updated Jan 27, 2020
Python

rsoaresp / bandits_notebooks

Star

a collection of google colab notebooks with educational stuff about bandits and their variations

jupyter-notebook python3 bandit-algorithms

Updated Mar 26, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandit-algorithms

Here are 85 public repositories matching this topic...

rssalessio / py-lower-bound-bai

fouratifares / RGL

Rajarshi1001 / CS780

szrlee / GPT-HyperAgent

Acemad / alphaNTBEA

alextanhongpin / bandit-learn

hughrawlinson / bandit-algorithms

jajajang / LowPopArt

rafaol / no-regret-approximate-inference-via-bo

2ailesB / RLD

anselmeamekoe / Graphs_in_ML_MVA

siavashadpey / MultiArmedBandits

4dnaanM / bandits

rojagtap / value-function-methods

jialinyi94 / matching-bandit

jialinyi94 / xbandits

park-jihoo / RL_TIL

JurajZelman / multi-armed-bandits

hamzaghojaria / Ads_CTR_ThompsonSampling

rsoaresp / bandits_notebooks

Improve this page

Add this topic to your repo