Non-Stationary Latent Bandits

Hong, Joey; Kveton, Branislav; Zaheer, Manzil; Chow, Yinlam; Ahmed, Amr; Ghavamzadeh, Mohammad; Boutilier, Craig

Computer Science > Machine Learning

arXiv:2012.00386 (cs)

[Submitted on 1 Dec 2020]

Title:Non-Stationary Latent Bandits

Authors:Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier

View PDF

Abstract:Users of recommender systems often behave in a non-stationary fashion, due to their evolving preferences and tastes over time. In this work, we propose a practical approach for fast personalization to non-stationary users. The key idea is to frame this problem as a latent bandit, where the prototypical models of user behavior are learned offline and the latent state of the user is inferred online from its interactions with the models. We call this problem a non-stationary latent bandit. We propose Thompson sampling algorithms for regret minimization in non-stationary latent bandits, analyze them, and evaluate them on a real-world dataset. The main strength of our approach is that it can be combined with rich offline-learned models, which can be misspecified, and are subsequently fine-tuned online using posterior sampling. In this way, we naturally combine the strengths of offline and online learning.

Comments:	15 pages, 4 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.00386 [cs.LG]
	(or arXiv:2012.00386v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.00386

Submission history

From: Joey Hong [view email]
[v1] Tue, 1 Dec 2020 10:31:57 UTC (303 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Joey Hong
Branislav Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed

…

export BibTeX citation

Computer Science > Machine Learning

Title:Non-Stationary Latent Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-Stationary Latent Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators