Skip to main content

Showing 1–42 of 42 results for author: Salmon, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.03356  [pdf, other

    cs.LG stat.AP

    Cooperative learning of Pl@ntNet's Artificial Intelligence algorithm: how does it work and how can we improve it?

    Authors: Tanguy Lefort, Antoine Affouard, Benjamin Charlier, Jean-Christophe Lombardo, Mathias Chouet, Hervé Goëau, Joseph Salmon, Pierre Bonnet, Alexis Joly

    Abstract: Deep learning models for plant species identification rely on large annotated datasets. The PlantNet system enables global data collection by allowing users to upload and annotate plant observations, leading to noisy labels due to diverse user skills. Achieving consensus is crucial for training, but the vast scale of collected data makes traditional label aggregation strategies challenging. Existi… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2209.15380  [pdf, other

    cs.LG cs.HC stat.ML

    Identify ambiguous tasks combining crowdsourced labels by weighting Areas Under the Margin

    Authors: Tanguy Lefort, Benjamin Charlier, Alexis Joly, Joseph Salmon

    Abstract: In supervised learning - for instance in image classification - modern massive datasets are commonly labeled by a crowd of workers. The obtained labels in this crowdsourcing setting are then aggregated for training, generally leveraging a per-worker trust score. Yet, such workers oriented approaches discard the tasks' ambiguity. Ambiguous tasks might fool expert workers, which is often harmful for… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    MSC Class: 68T05 ACM Class: I.2.6

  3. arXiv:2207.01560  [pdf, other

    cs.LG cs.CR stat.ML

    High-Dimensional Private Empirical Risk Minimization by Greedy Coordinate Descent

    Authors: Paul Mangold, Aurélien Bellet, Joseph Salmon, Marc Tommasi

    Abstract: In this paper, we study differentially private empirical risk minimization (DP-ERM). It has been shown that the worst-case utility of DP-ERM reduces polynomially as the dimension increases. This is a major obstacle to privately learning large machine learning models. In high dimension, it is common for some model's parameters to carry more information than others. To exploit this, we propose a dif… ▽ More

    Submitted 9 April, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

  4. arXiv:2206.13424  [pdf, other

    cs.LG math.OC stat.ML

    Benchopt: Reproducible, efficient and collaborative optimization benchmarks

    Authors: Thomas Moreau, Mathurin Massias, Alexandre Gramfort, Pierre Ablin, Pierre-Antoine Bannier, Benjamin Charlier, Mathieu Dagréou, Tom Dupré la Tour, Ghislain Durif, Cassio F. Dantas, Quentin Klopfenstein, Johan Larsson, En Lai, Tanguy Lefort, Benoit Malézieux, Badr Moufad, Binh T. Nguyen, Alain Rakotomamonjy, Zaccharie Ramzi, Joseph Salmon, Samuel Vaiter

    Abstract: Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementat… ▽ More

    Submitted 28 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted in proceedings of NeurIPS 22; Benchopt library documentation is available at https://benchopt.github.io/

  5. arXiv:2202.02193  [pdf, other

    stat.ML cs.LG

    Stochastic smoothing of the top-K calibrated hinge loss for deep imbalanced classification

    Authors: Camille Garcin, Maximilien Servajean, Alexis Joly, Joseph Salmon

    Abstract: In modern classification tasks, the number of labels is getting larger and larger, as is the size of the datasets encountered in practice. As the number of classes increases, class ambiguity and class imbalance become more and more problematic to achieve high top-1 accuracy. Meanwhile, Top-K metrics (metrics allowing K guesses) have become popular, especially for performance reporting. Yet, propos… ▽ More

    Submitted 17 July, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:7208-7222, 2022

  6. arXiv:2112.07990  [pdf, other

    math.OC stat.ML

    Supervised learning of analysis-sparsity priors with automatic differentiation

    Authors: Hashem Ghanem, Joseph Salmon, Nicolas Keriven, Samuel Vaiter

    Abstract: Sparsity priors are commonly used in denoising and image reconstruction. For analysis-type priors, a dictionary defines a representation of signals that is likely to be sparse. In most situations, this dictionary is not known, and is to be recovered from pairs of ground-truth signals and measurements, by minimizing the reconstruction error. This defines a hierarchical optimization problem, which c… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 5 pages, 4 figures

  7. arXiv:2110.11688  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Coordinate Descent for Composite Empirical Risk Minimization

    Authors: Paul Mangold, Aurélien Bellet, Joseph Salmon, Marc Tommasi

    Abstract: Machine learning models can leak information about the data used to train them. To mitigate this issue, Differentially Private (DP) variants of optimization algorithms like Stochastic Gradient Descent (DP-SGD) have been designed to trade-off utility for privacy in Empirical Risk Minimization (ERM) problems. In this paper, we propose Differentially Private proximal Coordinate Descent (DP-CD), a new… ▽ More

    Submitted 21 October, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: 36 pages, 3 figures

  8. arXiv:2106.14122  [pdf, other

    stat.ML cs.LG

    Score-Based Change Detection for Gradient-Based Learning Machines

    Authors: Lang Liu, Joseph Salmon, Zaid Harchaoui

    Abstract: The widespread use of machine learning algorithms calls for automatic change detection algorithms to monitor their behavior over time. As a machine learning algorithm learns from a continuous, possibly evolving, stream of data, it is desirable and often critical to supplement it with a companion change detection algorithm to facilitate its monitoring and control. We present a generic score-based c… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  9. arXiv:2106.02590  [pdf, other

    stat.ME math.ST stat.ML

    Spatially relaxed inference on high-dimensional linear models

    Authors: Jérôme-Alexis Chevalier, Tuan-Binh Nguyen, Bertrand Thirion, Joseph Salmon

    Abstract: We consider the inference problem for high-dimensional linear models, when covariates have an underlying spatial organization reflected in their correlation. A typical example of such a setting is high-resolution imaging, in which neighboring pixels are usually very similar. Accurate point and confidence intervals estimation is not possible in this context with many more covariates than samples, f… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  10. arXiv:2105.01637  [pdf, other

    stat.ML cs.LG math.OC

    Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward th… ▽ More

    Submitted 8 August, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

  11. arXiv:2010.11825  [pdf, other

    stat.ML cs.LG math.OC

    Model identification and local linear convergence of coordinate descent

    Authors: Quentin Klopfenstein, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon, Samuel Vaiter

    Abstract: For composite nonsmooth optimization problems, Forward-Backward algorithm achieves model identification (e.g. support identification for the Lasso) after a finite number of iterations, provided the objective function is regular enough. Results concerning coordinate descent are scarcer and model identification has only been shown for specific estimators, the support-vector machine for instance. In… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  12. arXiv:2009.14310  [pdf, other

    stat.ML cs.LG stat.AP

    Statistical control for spatio-temporal MEG/EEG source imaging with desparsified multi-task Lasso

    Authors: Jérôme-Alexis Chevalier, Alexandre Gramfort, Joseph Salmon, Bertrand Thirion

    Abstract: Detecting where and when brain regions activate in a cognitive task or in a given clinical condition is the promise of non-invasive techniques like magnetoencephalography (MEG) or electroencephalography (EEG). This problem, referred to as source localization, or source imaging, poses however a high-dimensional statistical inference challenge. While sparsity promoting regularizations have been prop… ▽ More

    Submitted 25 November, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 21 pages

  13. arXiv:2009.02709  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Screening Rules and its Complexity for Active Set Identification

    Authors: Eugene Ndiaye, Olivier Fercoq, Joseph Salmon

    Abstract: Screening rules were recently introduced as a technique for explicitly identifying active structures such as sparsity, in optimization problem arising in machine learning. This has led to new methods of acceleration based on a substantial dimension reduction. We show that screening rules stem from a combination of natural properties of subdifferential sets and optimality conditions, and can hence… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

  14. arXiv:2006.13533  [pdf, other

    cs.LG stat.ML

    Provably Convergent Working Set Algorithm for Non-Convex Regularized Regression

    Authors: Alain Rakotomamonjy, Rémi Flamary, Gilles Gasso, Joseph Salmon

    Abstract: Owing to their statistical properties, non-convex sparse regularizers have attracted much interest for estimating a sparse linear model from high dimensional data. Given that the solution is sparse, for accelerating convergence, a working set strategy addresses the optimization problem through an iterative algorithm by incre-menting the number of variables to optimize until the identification of t… ▽ More

    Submitted 20 October, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  15. arXiv:2002.08943  [pdf, other

    stat.ML cs.LG

    Implicit differentiation of Lasso-type models for hyperparameter optimization

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Setting regularization parameters for Lasso-type estimators is notoriously difficult, though crucial in practice. The most popular hyperparameter optimization approach is grid-search using held-out validation data. Grid-search however requires to choose a predefined grid for each parameter, which scales exponentially in the number of parameters. Another approach is to cast hyperparameter optimizat… ▽ More

    Submitted 3 September, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

  16. arXiv:2001.05401  [pdf, other

    stat.ML cs.LG math.OC

    Support recovery and sup-norm convergence rates for sparse pivotal estimation

    Authors: Mathurin Massias, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimensional sparse regression, pivotal estimators are estimators for which the optimal regularization parameter is independent of the noise level. The canonical pivotal estimator is the square-root Lasso, formulated along with its derivatives as a "non-smooth + non-smooth" optimization problem. Modern techniques to solve these include smoothing the datafitting term, to benefit from fast ef… ▽ More

    Submitted 3 September, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  17. arXiv:1907.05830  [pdf, other

    stat.ML cs.LG

    Dual Extrapolation for Sparse Generalized Linear Models

    Authors: Mathurin Massias, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Generalized Linear Models (GLM) form a wide class of regression and classification models, where prediction is a function of a linear combination of the input variables. For statistical inference in high dimension, sparsity inducing regularizations have proven to be useful while offering statistical guarantees. However, solving the resulting optimization problems can be challenging: even for popul… ▽ More

    Submitted 24 August, 2022; v1 submitted 12 July, 2019; originally announced July 2019.

  18. arXiv:1902.06125  [pdf, other

    cs.LG stat.ML

    Screening Rules for Lasso with Non-Convex Sparse Regularizers

    Authors: Alain Rakotomamonjy, Gilles Gasso, Joseph Salmon

    Abstract: Leveraging on the convexity of the Lasso problem , screening rules help in accelerating solvers by discarding irrelevant variables, during the optimization process. However, because they provide better theoretical guarantees in identifying relevant variables, several non-convex regularizers for the Lasso have been proposed in the literature. This work is the first that introduces a screening rule… ▽ More

    Submitted 19 February, 2019; v1 submitted 16 February, 2019; originally announced February 2019.

  19. arXiv:1902.02509  [pdf, other

    stat.ML cs.LG math.OC stat.AP

    Handling correlated and repeated measurements with the smoothed multivariate square-root Lasso

    Authors: Quentin Bertrand, Mathurin Massias, Alexandre Gramfort, Joseph Salmon

    Abstract: Sparsity promoting norms are frequently used in high dimensional regression. A limitation of such Lasso-type estimators is that the optimal regularization parameter depends on the unknown noise level. Estimators such as the concomitant Lasso address this dependence by jointly estimating the noise level and the regression coefficients. Additionally, in many applications, the data is obtained by ave… ▽ More

    Submitted 3 September, 2020; v1 submitted 7 February, 2019; originally announced February 2019.

  20. arXiv:1902.00071  [pdf, other

    math.OC cs.LG stat.ML

    Optimal mini-batch and step sizes for SAGA

    Authors: Nidham Gazagnadou, Robert M. Gower, Joseph Salmon

    Abstract: Recently it has been shown that the step sizes of a family of variance reduced gradient methods called the JacSketch methods depend on the expected smoothness constant. In particular, if this expected smoothness constant could be calculated a priori, then one could safely set much larger step sizes which would result in a much faster convergence rate. We fill in this gap, and provide simple closed… ▽ More

    Submitted 18 September, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Comments: 34 pages, 27 figures

    MSC Class: 90C15; 90C25; 68W20

  21. arXiv:1810.05471  [pdf, other

    stat.ML cs.LG math.OC

    Safe Grid Search with Optimal Complexity

    Authors: Eugene Ndiaye, Tam Le, Olivier Fercoq, Joseph Salmon, Ichiro Takeuchi

    Abstract: Popular machine learning estimators involve regularization parameters that can be challenging to tune, and standard strategies rely on grid search for this task. In this paper, we revisit the techniques of approximating the regularization path up to predefined tolerance $ε$ in a unified framework and show that its complexity is $O(1/\sqrt[d]ε)$ for uniformly convex loss of order $d \geq 2$ and… ▽ More

    Submitted 27 May, 2019; v1 submitted 12 October, 2018; originally announced October 2018.

    Journal ref: International Conference on Machine Learning, 2019

  22. arXiv:1806.05829  [pdf, other

    stat.AP

    Statistical Inference with Ensemble of Clustered Desparsified Lasso

    Authors: Jérôme-Alexis Chevalier, Joseph Salmon, Bertrand Thirion

    Abstract: Medical imaging involves high-dimensional data, yet their acquisition is obtained for limited samples. Multivariate predictive models have become popular in the last decades to fit some external variables from imaging data, and standard algorithms yield point estimates of the model parameters. It is however challenging to attribute confidence to these parameter estimates, which makes solutions har… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

  23. arXiv:1802.07481  [pdf, other

    stat.ML

    Celer: a Fast Solver for the Lasso with Dual Extrapolation

    Authors: Mathurin Massias, Alexandre Gramfort, Joseph Salmon

    Abstract: Convex sparsity-inducing regularizations are ubiquitous in high-dimensional machine learning, but solving the resulting optimization problems can be slow. To accelerate solvers, state-of-the-art approaches consist in reducing the size of the optimization problem at hand. In the context of regression, this can be achieved either by discarding irrelevant features (screening techniques) or by priorit… ▽ More

    Submitted 6 June, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

  24. A hierarchical Bayesian perspective on majorization-minimization for non-convex sparse regression: application to M/EEG source imaging

    Authors: Yousra Bekhti, Felix Lucka, Joseph Salmon, Alexandre Gramfort

    Abstract: Majorization-minimization (MM) is a standard iterative optimization technique which consists in minimizing a sequence of convex surrogate functionals. MM approaches have been particularly successful to tackle inverse problems and statistical machine learning problems where the regularization term is a sparsity-promoting concave function. However, due to non-convexity, the solution found by MM depe… ▽ More

    Submitted 6 June, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

  25. arXiv:1707.05232  [pdf, other

    math.ST stat.AP

    On Lasso refitting strategies

    Authors: Evgenii Chzhen, Mohamed Hebiri, Joseph Salmon

    Abstract: A well-know drawback of l_1-penalized estimators is the systematic shrinkage of the large coefficients towards zero. A simple remedy is to treat Lasso as a model-selection procedure and to perform a second refitting step on the selected support. In this work we formalize the notion of refitting and provide oracle bounds for arbitrary refitting procedures of the Lasso solution. One of the most wide… ▽ More

    Submitted 12 November, 2018; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: revised version

  26. arXiv:1705.09778  [pdf, other

    stat.ML math.OC stat.AP

    Generalized Concomitant Multi-Task Lasso for sparse multimodal regression

    Authors: Mathurin Massias, Olivier Fercoq, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimension, it is customary to consider Lasso-type estimators to enforce sparsity. For standard Lasso theory to hold, the regularization parameter should be proportional to the noise level, yet the latter is generally unknown in practice. A possible remedy is to consider estimators, such as the Concomitant/Scaled Lasso, which jointly optimize over the regression coefficients as well as over… ▽ More

    Submitted 18 October, 2017; v1 submitted 27 May, 2017; originally announced May 2017.

  27. arXiv:1703.07285  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    From safe screening rules to working sets for faster Lasso-type solvers

    Authors: Mathurin Massias, Alexandre Gramfort, Joseph Salmon

    Abstract: Convex sparsity-promoting regularizations are ubiquitous in modern statistical learning. By construction, they yield solutions with few non-zero coefficients, which correspond to saturated constraints in the dual optimization formulation. Working set (WS) strategies are generic optimization techniques that consist in solving simpler problems that only consider a subset of constraints, whose indice… ▽ More

    Submitted 1 May, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

  28. arXiv:1703.04697  [pdf, other

    math.ST cs.LG stat.ML

    On the benefits of output sparsity for multi-label classification

    Authors: Evgenii Chzhen, Christophe Denis, Mohamed Hebiri, Joseph Salmon

    Abstract: The multi-label classification framework, where each observation can be associated with a set of labels, has generated a tremendous amount of attention over recent years. The modern multi-label problems are typically large-scale in terms of number of observations, features and labels, and the amount of labels can even be comparable with the amount of observations. In this context, different remedi… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

  29. arXiv:1612.03080  [pdf, ps, other

    stat.ML

    Characterizing the maximum parameter of the total-variation denoising through the pseudo-inverse of the divergence

    Authors: Charles-Alban Deledalle, Nicolas Papadakis, Joseph Salmon, Samuel Vaiter

    Abstract: We focus on the maximum regularization parameter for anisotropic total-variation denoising. It corresponds to the minimum value of the regularization parameter above which the solution remains constant. While this value is well know for the Lasso, such a critical value has not been investigated in details for the total-variation. Though, it is of importance when tuning the regularization parameter… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

  30. arXiv:1611.05780  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Gap Safe screening rules for sparsity enforcing penalties

    Authors: Eugene Ndiaye, Olivier Fercoq, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimensional regression settings, sparsity enforcing penalties have proved useful to regularize the data-fitting term. A recently introduced technique called screening rules propose to ignore some variables in the optimization leveraging the expected sparsity of the solutions and consequently leading to faster solvers. When the procedure is guaranteed not to discard variables wrongly the ru… ▽ More

    Submitted 27 December, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

  31. arXiv:1606.05158  [pdf, other

    math.ST cs.CV stat.ML

    CLEAR: Covariant LEAst-square Re-fitting with applications to image restoration

    Authors: C-A. Deledalle, N. Papadakis, J. Salmon, S. Vaiter

    Abstract: In this paper, we propose a new framework to remove parts of the systematic errors affecting popular restoration algorithms, with a special focus for image processing tasks. Generalizing ideas that emerged for $\ell_1$ regularization, we develop an approach re-fitting the results of standard methods towards the input data. Total variation regularizations and non-local means are special cases of in… ▽ More

    Submitted 14 September, 2016; v1 submitted 16 June, 2016; originally announced June 2016.

  32. arXiv:1606.02702  [pdf, other

    stat.ML cs.LG math.OC

    Efficient Smoothed Concomitant Lasso Estimation for High Dimensional Regression

    Authors: Eugene Ndiaye, Olivier Fercoq, Alexandre Gramfort, Vincent Leclère, Joseph Salmon

    Abstract: In high dimensional settings, sparse structures are crucial for efficiency, both in term of memory, computation and performance. It is customary to consider $\ell_1$ penalty to enforce sparsity in such scenarios. Sparsity enforcing methods, the Lasso being a canonical example, are popular candidates to address high dimension. For efficiency, they rely on tuning a parameter trading data fitting ver… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

    MSC Class: 62J05; 90C25; 90C06

  33. arXiv:1606.02421  [pdf, other

    stat.ML cs.AI cs.DC cs.LG eess.SY

    Gossip Dual Averaging for Decentralized Optimization of Pairwise Functions

    Authors: Igor Colin, Aurélien Bellet, Joseph Salmon, Stéphan Clémençon

    Abstract: In decentralized networks (of sensors, connected objects, etc.), there is an important need for efficient algorithms to optimize a global cost function, for instance to learn a global model from the local data collected by each computing unit. In this paper, we address the problem of decentralized minimization of pairwise functions of the data points, where these points are distributed over the no… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  34. arXiv:1602.06225  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    GAP Safe Screening Rules for Sparse-Group-Lasso

    Authors: Eugene Ndiaye, Olivier Fercoq, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimensional settings, sparse structures are crucial for efficiency, either in term of memory, computation or performance. In some contexts, it is natural to handle more refined structures than pure sparsity, such as for instance group sparsity. Sparse-Group Lasso has recently been introduced in the context of linear regression to enforce sparsity both at the feature level and at the group… ▽ More

    Submitted 19 February, 2016; originally announced February 2016.

  35. arXiv:1511.05464  [pdf, other

    stat.ML cs.DC cs.LG eess.SY stat.CO

    Extending Gossip Algorithms to Distributed Estimation of U-Statistics

    Authors: Igor Colin, Aurélien Bellet, Joseph Salmon, Stéphan Clémençon

    Abstract: Efficient and robust algorithms for decentralized estimation in networks are essential to many distributed systems. Whereas distributed estimation of sample mean statistics has been the subject of a good deal of attention, computation of $U$-statistics, relying on more expensive averaging over pairs of observations, is a less investigated area. Yet, such data functionals are essential to describe… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

    Comments: to be presented at NIPS 2015

    MSC Class: 68Uxx; 62J15; 68Q32; 62-04;

  36. arXiv:1506.03736  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    GAP Safe screening rules for sparse multi-task and multi-class models

    Authors: Eugene Ndiaye, Olivier Fercoq, Alexandre Gramfort, Joseph Salmon

    Abstract: High dimensional regression benefits from sparsity promoting regularizations. Screening rules leverage the known sparsity of the solution by ignoring some variables in the optimization, hence speeding up solvers. When the procedure is proven not to discard features wrongly the rules are said to be \emph{safe}. In this paper we derive new safe rules for generalized linear models regularized with… ▽ More

    Submitted 18 November, 2015; v1 submitted 11 June, 2015; originally announced June 2015.

    Comments: in Proceedings of the 29-th Conference on Neural Information Processing Systems (NIPS), 2015

    MSC Class: 68Uxx; 49N15; 62Jxx; 68Q32; 62-04

  37. arXiv:1505.03410  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Mind the duality gap: safer rules for the Lasso

    Authors: Olivier Fercoq, Alexandre Gramfort, Joseph Salmon

    Abstract: Screening rules allow to early discard irrelevant variables from the optimization in Lasso problems, or its derivatives, making solvers faster. In this paper, we propose new versions of the so-called $\textit{safe rules}$ for the Lasso. Based on duality gap considerations, our new rules create safe test regions whose diameters converge to zero, provided that one relies on a converging solver. This… ▽ More

    Submitted 3 December, 2015; v1 submitted 13 May, 2015; originally announced May 2015.

    Comments: erratum to ICML 2015, "The authors would like to thanks Jalal Fadili and Jingwei Liang for helping clarifying some misleading statements on the equicorrelation set"

    MSC Class: 68Uxx; 49N15; 62Jxx; 68Q32; 62-04

  38. arXiv:1412.2632  [pdf, ps, other

    math.ST stat.ML

    Probabilistic low-rank matrix completion on finite alphabets

    Authors: Jean Lafond, Olga Klopp, Eric Moulines, Jospeh Salmon

    Abstract: The task of reconstructing a matrix given a sample of observedentries is known as the matrix completion problem. It arises ina wide range of problems, including recommender systems, collaborativefiltering, dimensionality reduction, image processing, quantum physics or multi-class classificationto name a few. Most works have focused on recovering an unknown real-valued low-rankmatrix from randomly… ▽ More

    Submitted 8 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: text overlap with arXiv:1408.6218

    Journal ref: NIPS, Dec 2014, Montreal, Canada

  39. arXiv:1410.5014  [pdf, other

    stat.ME math.ST

    Optimal Two-Step Prediction in Regression

    Authors: Didier Chételat, Johannes Lederer, Joseph Salmon

    Abstract: High-dimensional prediction typically comprises two steps: variable selection and subsequent least-squares refitting on the selected variables. However, the standard variable selection procedures, such as the lasso, hinge on tuning parameters that need to be calibrated. Cross-validation, the most popular calibration scheme, is computationally costly and lacks finite sample guarantees. In this pape… ▽ More

    Submitted 5 June, 2017; v1 submitted 18 October, 2014; originally announced October 2014.

  40. arXiv:1408.6218  [pdf, ps, other

    math.ST stat.ML

    Adaptive Multinomial Matrix Completion

    Authors: Olga Klopp, Jean Lafond, Eric Moulines, Joseph Salmon

    Abstract: The task of estimating a matrix given a sample of observed entries is known as the \emph{matrix completion problem}. Most works on matrix completion have focused on recovering an unknown real-valued low-rank matrix from a random sample of its entries. Here, we investigate the case of highly quantized observations when the measurements can take only a small number of values. These quantized outputs… ▽ More

    Submitted 26 August, 2014; originally announced August 2014.

  41. arXiv:1304.4549  [pdf, ps, other

    stat.ML

    Learning Heteroscedastic Models by Convex Programming under Group Sparsity

    Authors: Arnak S. Dalalyan, Mohamed Hebiri, Katia Méziani, Joseph Salmon

    Abstract: Popular sparse estimation methods based on $\ell_1$-relaxation, such as the Lasso and the Dantzig selector, require the knowledge of the variance of the noise in order to properly tune the regularization parameter. This constitutes a major obstacle in applying these methods in several frameworks---such as time series, random fields, inverse problems---for which the noise is rarely homoscedastic an… ▽ More

    Submitted 16 April, 2013; originally announced April 2013.

    Comments: Proceedings of the 30 th International Conference on Machine Learning (2013) https://icml.cc/2013/?page_id=43

  42. arXiv:1206.0338  [pdf, other

    cs.CV cs.LG stat.CO

    Poisson noise reduction with non-local PCA

    Authors: Joseph Salmon, Zachary Harmany, Charles-Alban Deledalle, Rebecca Willett

    Abstract: Photon-limited imaging arises when the number of photons collected by a sensor array is small relative to the number of detector elements. Photon limitations are an important concern for many applications such as spectral imaging, night vision, nuclear medicine, and astronomy. Typically a Poisson distribution is used to model these observations, and the inherent heteroscedasticity of the data comb… ▽ More

    Submitted 28 April, 2014; v1 submitted 1 June, 2012; originally announced June 2012.

    Comments: erratum: Image man is wrongly name pepper in the journal version