Unbalanced Optimal Transport, from Theory to Numerics

Séjourné, Thibault; Peyré, Gabriel; Vialard, François-Xavier

Statistics > Machine Learning

arXiv:2211.08775 (stat)

[Submitted on 16 Nov 2022 (v1), last revised 16 Jan 2023 (this version, v2)]

Title:Unbalanced Optimal Transport, from Theory to Numerics

Authors:Thibault Séjourné, Gabriel Peyré, François-Xavier Vialard

View PDF

Abstract:Optimal Transport (OT) has recently emerged as a central tool in data sciences to compare in a geometrically faithful way point clouds and more generally probability distributions. The wide adoption of OT into existing data analysis and machine learning pipelines is however plagued by several shortcomings. This includes its lack of robustness to outliers, its high computational costs, the need for a large number of samples in high dimension and the difficulty to handle data in distinct spaces. In this review, we detail several recently proposed approaches to mitigate these issues. We insist in particular on unbalanced OT, which compares arbitrary positive measures, not restricted to probability distributions (i.e. their total mass can vary). This generalization of OT makes it robust to outliers and missing data. The second workhorse of modern computational OT is entropic regularization, which leads to scalable algorithms while lowering the sample complexity in high dimension. The last point presented in this review is the Gromov-Wasserstein (GW) distance, which extends OT to cope with distributions belonging to different metric spaces. The main motivation for this review is to explain how unbalanced OT, entropic regularization and GW can work hand-in-hand to turn OT into efficient geometric loss functions for data sciences.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2211.08775 [stat.ML]
	(or arXiv:2211.08775v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2211.08775

Submission history

From: Thibault Sejourne [view email]
[v1] Wed, 16 Nov 2022 09:02:52 UTC (6,596 KB)
[v2] Mon, 16 Jan 2023 13:58:10 UTC (6,614 KB)

Statistics > Machine Learning

Title:Unbalanced Optimal Transport, from Theory to Numerics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Unbalanced Optimal Transport, from Theory to Numerics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators