Skip to main content

Showing 1–39 of 39 results for author: Peyré, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.01367  [pdf, ps, other

    cs.CL stat.ML

    Transformers are Universal In-context Learners

    Authors: Takashi Furuya, Maarten V. de Hoop, Gabriel Peyré

    Abstract: Transformers are deep architectures that define "in-context mappings" which enable predicting new tokens based on a given set of tokens (such as a prompt in NLP applications or a set of patches for vision transformers). This work studies in particular the ability of these architectures to handle an arbitrarily large number of context tokens. To mathematically and uniformly address the expressivity… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 16 pages

  2. arXiv:2402.05787  [pdf, other

    stat.ML cs.LG

    How do Transformers perform In-Context Autoregressive Learning?

    Authors: Michael E. Sander, Raja Giryes, Taiji Suzuki, Mathieu Blondel, Gabriel Peyré

    Abstract: Transformers have achieved state-of-the-art performance in language modeling tasks. However, the reasons behind their tremendous success are still unclear. In this paper, towards a better understanding, we train a Transformer model on a simple next token prediction task, where sequences are generated as a first-order autoregressive process $s_{t+1} = W s_t$. We show how a trained Transformer predi… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 20 pages ICML 2024

  3. arXiv:2311.05788  [pdf, other

    cs.LG math.OC stat.ML

    Structured Transforms Across Spaces with Cost-Regularized Optimal Transport

    Authors: Othmane Sebbouh, Marco Cuturi, Gabriel Peyré

    Abstract: Matching a source to a target probability measure is often solved by instantiating a linear optimal transport (OT) problem, parameterized by a ground cost function that quantifies discrepancy between points. When these measures live in the same metric space, the ground cost often defaults to its distance. When instantiated across two different spaces, however, choosing that cost in the absence of… ▽ More

    Submitted 23 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

  4. arXiv:2305.15042  [pdf, other

    cs.LG stat.ML

    Test like you Train in Implicit Deep Learning

    Authors: Zaccharie Ramzi, Pierre Ablin, Gabriel Peyré, Thomas Moreau

    Abstract: Implicit deep learning has recently gained popularity with applications ranging from meta-learning to Deep Equilibrium Networks (DEQs). In its general formulation, it relies on expressing some components of deep learning pipelines implicitly, typically via a root equation called the inner problem. In practice, the solution of the inner problem is approximated during training with an iterative proc… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  5. arXiv:2302.01425  [pdf, other

    cs.LG stat.ML

    Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective

    Authors: Michael E. Sander, Joan Puigcerver, Josip Djolonga, Gabriel Peyré, Mathieu Blondel

    Abstract: The top-k operator returns a sparse vector, where the non-zero values correspond to the k largest values of the input. Unfortunately, because it is a discontinuous function, it is difficult to incorporate in neural networks trained end-to-end with backpropagation. Recent works have considered differentiable relaxations, based either on regularization or perturbation techniques. However, to date, n… ▽ More

    Submitted 4 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: ICML 2023 18 pages

  6. arXiv:2211.08775  [pdf, other

    stat.ML cs.LG math.OC

    Unbalanced Optimal Transport, from Theory to Numerics

    Authors: Thibault Séjourné, Gabriel Peyré, François-Xavier Vialard

    Abstract: Optimal Transport (OT) has recently emerged as a central tool in data sciences to compare in a geometrically faithful way point clouds and more generally probability distributions. The wide adoption of OT into existing data analysis and machine learning pipelines is however plagued by several shortcomings. This includes its lack of robustness to outliers, its high computational costs, the need for… ▽ More

    Submitted 16 January, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

  7. arXiv:2205.14612  [pdf, other

    cs.LG stat.ML

    Do Residual Neural Networks discretize Neural Ordinary Differential Equations?

    Authors: Michael E. Sander, Pierre Ablin, Gabriel Peyré

    Abstract: Neural Ordinary Differential Equations (Neural ODEs) are the continuous analog of Residual Neural Networks (ResNets). We investigate whether the discrete dynamics defined by a ResNet are close to the continuous one of a Neural ODE. We first quantify the distance between the ResNet's hidden state trajectory and the solution of its corresponding Neural ODE. Our bound is tight and, on the negative si… ▽ More

    Submitted 15 September, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

    Comments: Accepted at NeurIPS 2022 24 pages

  8. arXiv:2205.01385  [pdf, other

    math.OC cs.LG stat.ML

    Smooth over-parameterized solvers for non-smooth structured optimization

    Authors: Clarice Poon, Gabriel Peyré

    Abstract: Non-smooth optimization is a core ingredient of many imaging or machine learning pipelines. Non-smoothness encodes structural constraints on the solutions, such as sparsity, group sparsity, low-rank and sharp edges. It is also the basis for the definition of robust loss functions and scale-free functionals such as square-root Lasso. Standard approaches to deal with non-smoothness leverage either p… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

  9. arXiv:2111.13162  [pdf, other

    cs.LG math.OC stat.ML

    Randomized Stochastic Gradient Descent Ascent

    Authors: Othmane Sebbouh, Marco Cuturi, Gabriel Peyré

    Abstract: An increasing number of machine learning problems, such as robust or adversarial variants of existing algorithms, require minimizing a loss function that is itself defined as a maximum. Carrying a loop of stochastic gradient ascent (SGA) steps on the (inner) maximization problem, followed by an SGD step on the (outer) minimization, is known as Epoch Stochastic Gradient \textit{Descent Ascent} (ESG… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  10. arXiv:2110.11773  [pdf, other

    cs.LG stat.ML

    Sinkformers: Transformers with Doubly Stochastic Attention

    Authors: Michael E. Sander, Pierre Ablin, Mathieu Blondel, Gabriel Peyré

    Abstract: Attention based models such as Transformers involve pairwise interactions between data points, modeled with a learnable attention matrix. Importantly, this attention matrix is normalized with the SoftMax operator, which makes it row-wise stochastic. In this paper, we propose instead to use Sinkhorn's algorithm to make attention matrices doubly stochastic. We call the resulting model a Sinkformer.… ▽ More

    Submitted 24 January, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Accepted at AISTATS

  11. arXiv:2106.01429  [pdf, other

    stat.ML cs.LG math.OC

    Smooth Bilevel Programming for Sparse Regularization

    Authors: Clarice Poon, Gabriel Peyré

    Abstract: Iteratively reweighted least square (IRLS) is a popular approach to solve sparsity-enforcing regression problems in machine learning. State of the art approaches are more efficient but typically rely on specific coordinate pruning schemes. In this work, we show how a surprisingly simple reparametrization of IRLS, coupled with a bilevel resolution (instead of an alternating scheme) is able to achie… ▽ More

    Submitted 30 September, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Journal ref: Proc. Neurips 2021

  12. arXiv:2106.01128  [pdf, other

    cs.LG stat.ML

    Linear-Time Gromov Wasserstein Distances using Low Rank Couplings and Costs

    Authors: Meyer Scetbon, Gabriel Peyré, Marco Cuturi

    Abstract: The ability to align points across two related yet incomparable point clouds (e.g. living in different spaces) plays an important role in machine learning. The Gromov-Wasserstein (GW) framework provides an increasingly popular answer to such problems, by seeking a low-distortion, geometry-preserving assignment between these points. As a non-convex, quadratic generalization of optimal transport (OT… ▽ More

    Submitted 6 February, 2023; v1 submitted 2 June, 2021; originally announced June 2021.

  13. arXiv:2103.04737  [pdf, other

    stat.ML cs.LG

    Low-Rank Sinkhorn Factorization

    Authors: Meyer Scetbon, Marco Cuturi, Gabriel Peyré

    Abstract: Several recent applications of optimal transport (OT) theory to machine learning have relied on regularization, notably entropy and the Sinkhorn algorithm. Because matrix-vector products are pervasive in the Sinkhorn algorithm, several works have proposed to \textit{approximate} kernel matrices appearing in its iterations using low-rank factors. Another route lies instead in imposing low-rank cons… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  14. arXiv:2102.07870  [pdf, other

    cs.LG cs.AI stat.ML

    Momentum Residual Neural Networks

    Authors: Michael E. Sander, Pierre Ablin, Mathieu Blondel, Gabriel Peyré

    Abstract: The training of deep residual neural networks (ResNets) with backpropagation has a memory cost that increases linearly with respect to the depth of the network. A way to circumvent this issue is to use reversible architectures. In this paper, we propose to change the forward rule of a ResNet by adding a momentum term. The resulting networks, momentum residual neural networks (Momentum ResNets), ar… ▽ More

    Submitted 22 July, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 24 pages

  15. arXiv:2102.07432  [pdf, other

    stat.ML cs.LG stat.CO

    Fast and accurate optimization on the orthogonal manifold without retraction

    Authors: Pierre Ablin, Gabriel Peyré

    Abstract: We consider the problem of minimizing a function over the manifold of orthogonal matrices. The majority of algorithms for this problem compute a direction in the tangent space, and then use a retraction to move in that direction while staying on the manifold. Unfortunately, the numerical computation of retractions on the orthogonal manifold always involves some expensive linear algebra operation,… ▽ More

    Submitted 31 January, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

  16. arXiv:2102.06278  [pdf, other

    stat.ML cs.LG

    Unsupervised Ground Metric Learning using Wasserstein Singular Vectors

    Authors: Geert-Jan Huizing, Laura Cantini, Gabriel Peyré

    Abstract: Defining meaningful distances between samples in a dataset is a fundamental problem in machine learning. Optimal Transport (OT) lifts a distance between features (the "ground metric") to a geometrically meaningful distance between samples. However, there is usually no straightforward choice of ground metric. Supervised ground metric learning approaches exist but require labeled data. In absence of… ▽ More

    Submitted 19 July, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

  17. arXiv:2009.04266  [pdf, other

    math.OC stat.ML

    The Unbalanced Gromov Wasserstein Distance: Conic Formulation and Relaxation

    Authors: Thibault Séjourné, François-Xavier Vialard, Gabriel Peyré

    Abstract: Comparing metric measure spaces (i.e. a metric space endowed with aprobability distribution) is at the heart of many machine learning problems. The most popular distance between such metric measure spaces is theGromov-Wasserstein (GW) distance, which is the solution of a quadratic assignment problem. The GW distance is however limited to the comparison of metric measure spaces endowed with a proba… ▽ More

    Submitted 16 January, 2023; v1 submitted 9 September, 2020; originally announced September 2020.

  18. arXiv:2006.13708  [pdf, other

    stat.ML cs.LG

    Distribution-Based Invariant Deep Networks for Learning Meta-Features

    Authors: Gwendoline De Bie, Herilalaina Rakotoarison, Gabriel Peyré, Michèle Sebag

    Abstract: Recent advances in deep learning from probability distributions successfully achieve classification or regression from distribution samples, thus invariant under permutation of the samples. The first contribution of the paper is to extend these neural architectures to achieve invariance under permutation of the features, too. The proposed architecture, called Dida, inherits the NN properties of un… ▽ More

    Submitted 18 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  19. arXiv:2006.08172  [pdf, other

    math.OC math.ST stat.ML

    Faster Wasserstein Distance Estimation with the Sinkhorn Divergence

    Authors: Lenaic Chizat, Pierre Roussillon, Flavien Léger, François-Xavier Vialard, Gabriel Peyré

    Abstract: The squared Wasserstein distance is a natural quantity to compare probability distributions in a non-parametric setting. This quantity is usually estimated with the plug-in estimator, defined via a discrete optimal transport problem which can be solved to $ε$-accuracy by adding an entropic regularization of order $ε$ and using for instance Sinkhorn's algorithm. In this work, we propose instead to… ▽ More

    Submitted 29 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Journal ref: Neural Information Processing Systems, Dec 2020, Vancouver, Canada

  20. arXiv:2003.01415  [pdf, other

    math.OC stat.ML

    Online Sinkhorn: Optimal Transport distances from sample streams

    Authors: Arthur Mensch, Gabriel Peyré

    Abstract: Optimal Transport (OT) distances are now routinely used as loss functions in ML tasks. Yet, computing OT distances between arbitrary (i.e. not necessarily discrete) probability distributions remains an open problem. This paper introduces a new online estimator of entropy-regularized OT distances between two such arbitrary distributions. It uses streams of samples from both distributions to iterati… ▽ More

    Submitted 2 July, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  21. arXiv:2002.03722  [pdf, other

    stat.ML cs.LG

    Super-efficiency of automatic differentiation for functions defined as a minimum

    Authors: Pierre Ablin, Gabriel Peyré, Thomas Moreau

    Abstract: In min-min optimization or max-min optimization, one has to compute the gradient of a function defined as a minimum. In most cases, the minimum has no closed-form, and an approximation is obtained via an iterative algorithm. There are two usual ways of estimating the gradient of the function: using either an analytic formula obtained by assuming exactness of the approximation, or automatic differe… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 31 pages

  22. arXiv:1911.03577  [pdf, other

    stat.ML cs.LG math.ST

    Degrees of freedom for off-the-grid sparse estimation

    Authors: Clarice Poon, Gabriel Peyré

    Abstract: A central question in modern machine learning and imaging sciences is to quantify the number of effective parameters of vastly over-parameterized models. The degrees of freedom is a mathematically convenient way to define this number of parameters. Its computation and properties are well understood when dealing with discretized linear models, possibly regularized using sparsity. In this paper, we… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

  23. arXiv:1911.03117  [pdf, other

    stat.ML cs.GR cs.LG math.OC

    Ground Metric Learning on Graphs

    Authors: Matthieu Heitz, Nicolas Bonneel, David Coeurjolly, Marco Cuturi, Gabriel Peyré

    Abstract: Optimal transport (OT) distances between probability distributions are parameterized by the ground metric they use between observations. Their relevance for real-life applications strongly hinges on whether that ground metric parameter is suitably chosen. Selecting it adaptively and algorithmically from prior knowledge, the so-called ground metric learning GML) problem, has therefore appeared in v… ▽ More

    Submitted 5 November, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Fixed sign of gradient

    Journal ref: Journal of Mathematical Imaging and Vision (2020): 1-19

  24. arXiv:1910.12958  [pdf, other

    math.OC cs.LG stat.ML

    Sinkhorn Divergences for Unbalanced Optimal Transport

    Authors: Thibault Séjourné, Jean Feydy, François-Xavier Vialard, Alain Trouvé, Gabriel Peyré

    Abstract: Optimal transport induces the Earth Mover's (Wasserstein) distance between probability distributions, a geometric divergence that is relevant to a wide range of problems. Over the last decade, two relaxations of optimal transport have been studied in depth: unbalanced transport, which is robust to the presence of outliers and can be used when distributions don't have the same total mass; entropy-r… ▽ More

    Submitted 16 January, 2023; v1 submitted 28 October, 2019; originally announced October 2019.

  25. arXiv:1905.06005  [pdf, other

    stat.ML cs.LG math.OC

    Geometric Losses for Distributional Learning

    Authors: Arthur Mensch, Mathieu Blondel, Gabriel Peyré

    Abstract: Building upon recent advances in entropy-regularized optimal transport, and upon Fenchel duality between measures and continuous functions , we propose a generalization of the logistic loss that incorporates a metric or cost between classes. Unlike previous attempts to use optimal transport distances for learning, our loss results in unconstrained convex objective functions, supports infinite (or… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Journal ref: Proceedings of the International Conference on Machine Learning, 2019, Long Beach, United States

  26. arXiv:1905.04943  [pdf, other

    cs.LG stat.ML

    Universal Invariant and Equivariant Graph Neural Networks

    Authors: Nicolas Keriven, Gabriel Peyré

    Abstract: Graph Neural Networks (GNN) come in many flavors, but should always be either invariant (permutation of the nodes of the input graph does not affect the output) or equivariant (permutation of the input permutes the output). In this paper, we consider a specific class of invariant and equivariant networks, for which we prove new universality theorems. More precisely, we consider networks with a sin… ▽ More

    Submitted 24 October, 2019; v1 submitted 13 May, 2019; originally announced May 2019.

  27. arXiv:1811.07429  [pdf, other

    stat.ML cs.LG

    Stochastic Deep Networks

    Authors: Gwendoline de Bie, Gabriel Peyré, Marco Cuturi

    Abstract: Machine learning is increasingly targeting areas where input data cannot be accurately described by a single vector, but can be modeled instead using the more flexible concept of random vectors, namely probability measures or more simply point clouds of varying cardinality. Using deep architectures on measures poses, however, many challenging issues. Indeed, deep architectures are originally desig… ▽ More

    Submitted 20 February, 2019; v1 submitted 18 November, 2018; originally announced November 2018.

  28. arXiv:1811.05527  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Semi-dual Regularized Optimal Transport

    Authors: Marco Cuturi, Gabriel Peyré

    Abstract: Variational problems that involve Wasserstein distances and more generally optimal transport (OT) theory are playing an increasingly important role in data sciences. Such problems can be used to form an examplar measure out of various probability measures, as in the Wasserstein barycenter problem, or to carry out parametric inference and density fitting, where the loss is measured in terms of an o… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Journal ref: SIAM Review, 60(4), 941-965, 2018

  29. arXiv:1803.00567  [pdf, other

    stat.ML

    Computational Optimal Transport

    Authors: Gabriel Peyré, Marco Cuturi

    Abstract: Optimal transport (OT) theory can be informally described using the words of the French mathematician Gaspard Monge (1746-1818): A worker with a shovel in hand has to move a large pile of sand lying on a construction site. The goal of the worker is to erect with all that sand a target pile with a prescribed shape (for example, that of a giant sand castle). Naturally, the worker wishes to minimize… ▽ More

    Submitted 18 March, 2020; v1 submitted 1 March, 2018; originally announced March 2018.

    Comments: new version with corrected typo in Eq. 4.43 and 4.44 (minus sign in front of f, g now changed to +) a few more corrected typos

    Journal ref: Foundations and Trends in Machine Learning, vol. 11, no. 5-6, pp. 355-607, 2019

  30. arXiv:1708.01955  [pdf, other

    stat.ML cs.GR math.OC

    Wasserstein Dictionary Learning: Optimal Transport-based unsupervised non-linear dictionary learning

    Authors: Morgan A. Schmitz, Matthieu Heitz, Nicolas Bonneel, Fred Maurice Ngolè Mboula, David Coeurjolly, Marco Cuturi, Gabriel Peyré, Jean-Luc Starck

    Abstract: This paper introduces a new nonlinear dictionary learning method for histograms in the probability simplex. The method leverages optimal transport theory, in the sense that our aim is to reconstruct histograms using so-called displacement interpolations (a.k.a. Wasserstein barycenters) between dictionary atoms; such atoms are themselves synthetic histograms in the probability simplex. Our method s… ▽ More

    Submitted 15 March, 2018; v1 submitted 6 August, 2017; originally announced August 2017.

    Comments: Published in SIAM SIIMS. 46 pages, 24 figures

    Journal ref: SIAM Journal on Imaging Sciences 11(1) (2018) 643-678

  31. arXiv:1707.03194  [pdf, other

    math.OC cs.CV stat.ML

    Sensitivity Analysis for Mirror-Stratifiable Convex Functions

    Authors: Jalal Fadili, Jérôme Malick, Gabriel Peyré

    Abstract: This paper provides a set of sensitivity analysis and activity identification results for a class of convex functions with a strong geometric structure, that we coined "mirror-stratifiable". These functions are such that there is a bijection between a primal and a dual stratification of the space into partitioning sets, called strata. This pairing is crucial to track the strata that are identifiab… ▽ More

    Submitted 5 June, 2018; v1 submitted 11 July, 2017; originally announced July 2017.

    MSC Class: 65K05; 65K10; 90C25; 90C31

  32. arXiv:1706.01807  [pdf, other

    stat.ML

    GAN and VAE from an Optimal Transport Point of View

    Authors: Aude Genevay, Gabriel Peyré, Marco Cuturi

    Abstract: This short article revisits some of the ideas introduced in arXiv:1701.07875 and arXiv:1705.07642 in a simple setup. This sheds some lights on the connexions between Variational Autoencoders (VAE), Generative Adversarial Networks (GAN) and Minimum Kantorovitch Estimators (MKE).

    Submitted 6 June, 2017; originally announced June 2017.

  33. arXiv:1706.00292  [pdf, other

    stat.ML

    Learning Generative Models with Sinkhorn Divergences

    Authors: Aude Genevay, Gabriel Peyré, Marco Cuturi

    Abstract: The ability to compare two degenerate probability distributions (i.e. two probability distributions supported on two distinct low-dimensional manifolds living in a much higher-dimensional space) is a crucial problem arising in the estimation of generative models for high-dimensional observations such as those arising in computer vision or natural language. It is known that optimal transport metric… ▽ More

    Submitted 20 October, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

  34. arXiv:1503.02533  [pdf, other

    stat.ML math.OC

    A Smoothed Dual Approach for Variational Wasserstein Problems

    Authors: Marco Cuturi, Gabriel Peyré

    Abstract: Variational problems that involve Wasserstein distances have been recently proposed to summarize and learn from probability measures. Despite being conceptually simple, such problems are computationally challenging because they involve minimizing over quantities (Wasserstein distances) that are themselves hard to compute. We show that the dual formulation of Wasserstein variational problems introd… ▽ More

    Submitted 24 August, 2015; v1 submitted 9 March, 2015; originally announced March 2015.

  35. arXiv:1407.1598  [pdf, ps, other

    math.OC cs.IT stat.ML

    Low Complexity Regularization of Linear Inverse Problems

    Authors: Samuel Vaiter, Gabriel Peyré, Jalal M. Fadili

    Abstract: Inverse problems and regularization theory is a central theme in contemporary signal processing, where the goal is to reconstruct an unknown signal from partial indirect, and possibly noisy, measurements of it. A now standard method for recovering the unknown signal is to solve a convex optimization problem that enforces some prior knowledge about its structure. This has proved efficient in many p… ▽ More

    Submitted 8 December, 2014; v1 submitted 7 July, 2014; originally announced July 2014.

  36. arXiv:1405.1164  [pdf, ps, other

    math.ST stat.AP

    Stein Unbiased GrAdient estimator of the Risk (SUGAR) for multiple parameter selection

    Authors: Charles-Alban Deledalle, Samuel Vaiter, Jalal M. Fadili, Gabriel Peyré

    Abstract: Algorithms to solve variational regularization of ill-posed inverse problems usually involve operators that depend on a collection of continuous parameters. When these operators enjoy some (local) regularity, these parameters can be selected using the so-called Stein Unbiased Risk Estimate (SURE). While this selection is usually performed by exhaustive search, we address in this work the problem o… ▽ More

    Submitted 9 August, 2014; v1 submitted 6 May, 2014; originally announced May 2014.

  37. arXiv:1405.1004  [pdf, other

    math.OC cs.IT stat.ML

    Model Consistency of Partly Smooth Regularizers

    Authors: Samuel Vaiter, Gabriel Peyré, Jalal M. Fadili

    Abstract: This paper studies least-square regression penalized with partly smooth convex regularizers. This class of functions is very large and versatile allowing to promote solutions conforming to some notion of low-complexity. Indeed, they force solutions of variational problems to belong to a low-dimensional manifold (the so-called model) which is stable under small perturbations of the function. This p… ▽ More

    Submitted 29 June, 2014; v1 submitted 5 May, 2014; originally announced May 2014.

  38. arXiv:1301.5874  [pdf, ps, other

    math.ST stat.AP

    Stein COnsistent Risk Estimator (SCORE) for hard thresholding

    Authors: Charles-Alban Deledalle, Gabriel Peyré, Jalal Fadili

    Abstract: In this work, we construct a risk estimator for hard thresholding which can be used as a basis to solve the difficult task of automatically selecting the threshold. As hard thresholding is not even continuous, Stein's lemma cannot be used to get an unbiased estimator of degrees of freedom, hence of the risk. We prove that under a mild condition, our estimator of the degrees of freedom, although bi… ▽ More

    Submitted 24 January, 2013; originally announced January 2013.

  39. arXiv:1205.1482  [pdf, ps, other

    math.OC cs.IT cs.LG math.ST stat.ML

    Risk estimation for matrix recovery with spectral regularization

    Authors: Charles-Alban Deledalle, Samuel Vaiter, Gabriel Peyré, Jalal Fadili, Charles Dossal

    Abstract: In this paper, we develop an approach to recursively estimate the quadratic risk for matrix recovery problems regularized with spectral functions. Toward this end, in the spirit of the SURE theory, a key step is to compute the (weak) derivative and divergence of a solution with respect to the observations. As such a solution is not available in closed form, but rather through a proximal splitting… ▽ More

    Submitted 1 November, 2012; v1 submitted 7 May, 2012; originally announced May 2012.

    Comments: This version is an update of our original paper presented at ICML'2012 workshop on Sparsity, Dictionaries and Projections in Machine Learning and Signal Processing