Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits

Jun, Kwang-Sung; Jain, Lalit; Nassif, Houssam

Statistics > Machine Learning

arXiv:2011.11222v1 (stat)

[Submitted on 23 Nov 2020 (this version), latest version 18 Mar 2021 (v2)]

Title:Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits

Authors:Kwang-Sung Jun, Lalit Jain, Houssam Nassif

View PDF

Abstract:We propose improved fixed-design confidence bounds for the linear logistic model. Our bounds significantly improve upon the state-of-the-art bounds of Li et al. (2017) by leveraging the self-concordance of the logistic loss inspired by Faury et al. (2020). Specifically, our confidence width does not scale with the problem dependent parameter $1/\kappa$, where $\kappa$ is the worst-case variance of an arm reward. At worse, $\kappa$ scales exponentially with the norm of the unknown linear parameter $\theta^*$. Instead, our bound scales directly on the local variance induced by $\theta^*$. We present two applications of our novel bounds on two logistic bandit problems: regret minimization and pure exploration. Our analysis shows that the new confidence bounds improve upon previous state-of-the-art performance guarantees.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2011.11222 [stat.ML]
	(or arXiv:2011.11222v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2011.11222

Submission history

From: Lalit Jain [view email]
[v1] Mon, 23 Nov 2020 05:44:26 UTC (675 KB)
[v2] Thu, 18 Mar 2021 04:45:43 UTC (1,096 KB)

Statistics > Machine Learning

Title:Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Improved Confidence Bounds for the Linear Logistic Model and Applications to Linear Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators