A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Lee, Junghyun; Yun, Se-Young; Jun, Kwang-Sung

Statistics > Machine Learning

arXiv:2407.13977 (stat)

[Submitted on 19 Jul 2024 (v1), last revised 31 Oct 2024 (this version, v2)]

Title:A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Authors:Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

View PDF HTML (experimental)

Abstract:We present a unified likelihood ratio-based confidence sequence (CS) for any (self-concordant) generalized linear model (GLM) that is guaranteed to be convex and numerically tight. We show that this is on par or improves upon known CSs for various GLMs, including Gaussian, Bernoulli, and Poisson. In particular, for the first time, our CS for Bernoulli has a $\mathrm{poly}(S)$-free radius where $S$ is the norm of the unknown parameter. Our first technical novelty is its derivation, which utilizes a time-uniform PAC-Bayesian bound with a uniform prior/posterior, despite the latter being a rather unpopular choice for deriving CSs. As a direct application of our new CS, we propose a simple and natural optimistic algorithm called OFUGLB, applicable to any generalized linear bandits (GLB; Filippi et al. (2010)). Our analysis shows that the celebrated optimistic approach simultaneously attains state-of-the-art regrets for various self-concordant (not necessarily bounded) GLBs, and even $\mathrm{poly}(S)$-free for bounded GLBs, including logistic bandits. The regret analysis, our second technical novelty, follows from combining our new CS with a new proof technique that completely avoids the previously widely used self-concordant control lemma (Faury et al., 2020, Lemma 9). Numerically, OFUGLB outperforms or is at par with prior algorithms for logistic bandits.

Comments:	39 pages, 2 figures, 2 tables; Accepted to the 38th Conference on Neural Information Processing Systems (NeurIPS 2024) (ver2: major revision, including new experiments, reorganization, fixing typos in the proofs of ver1, etc)
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2407.13977 [stat.ML]
	(or arXiv:2407.13977v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2407.13977

Submission history

From: Junghyun Lee [view email]
[v1] Fri, 19 Jul 2024 02:06:08 UTC (910 KB)
[v2] Thu, 31 Oct 2024 06:14:52 UTC (26,709 KB)

Statistics > Machine Learning

Title:A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators