Skip to main content

Showing 1–3 of 3 results for author: Cherkaoui, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2309.08710  [pdf, other

    cs.LG stat.ML

    Clustered Multi-Agent Linear Bandits

    Authors: Hamza Cherkaoui, Merwan Barlier, Igor Colin

    Abstract: We address in this paper a particular instance of the multi-agent linear stochastic bandit problem, called clustered multi-agent linear bandits. In this setting, we propose a novel algorithm leveraging an efficient collaboration between the agents in order to accelerate the overall optimization problem. In this contribution, a network controller is responsible for estimating the underlying cluster… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 20 pages, 10 figures

  2. arXiv:2309.08709  [pdf, other

    stat.ML cs.LG

    Price of Safety in Linear Best Arm Identification

    Authors: Xuedong Shang, Igor Colin, Merwan Barlier, Hamza Cherkaoui

    Abstract: We introduce the safe best-arm identification framework with linear feedback, where the agent is subject to some stage-wise safety constraint that linearly depends on an unknown parameter vector. The agent must take actions in a conservative way so as to ensure that the safety constraint is not violated with high probability at each round. Ways of leveraging the linear structure for ensuring safet… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 20 pages, 1 figures

  3. arXiv:2010.09545  [pdf, other

    math.OC stat.ML

    Learning to solve TV regularized problems with unrolled algorithms

    Authors: Hamza Cherkaoui, Jeremias Sulam, Thomas Moreau

    Abstract: Total Variation (TV) is a popular regularization strategy that promotes piece-wise constant signals by constraining the $\ell_1$-norm of the first order derivative of the estimated signal. The resulting optimization problem is usually solved using iterative algorithms such as proximal gradient descent, primal-dual algorithms or ADMM. However, such methods can require a very large number of iterati… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: accepted to NeurIPS 2020