Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation (2103.03606v1)

Published 5 Mar 2021 in cs.LG, math.ST, stat.ML, and stat.TH

Abstract: Optimal transport distances have found many applications in machine learning for their capacity to compare non-parametric probability distributions. Yet their algorithmic complexity generally prevents their direct use on large scale datasets. Among the possible strategies to alleviate this issue, practitioners can rely on computing estimates of these distances over subsets of data, {\em i.e.} minibatches. While computationally appealing, we highlight in this paper some limits of this strategy, arguing it can lead to undesirable smoothing effects. As an alternative, we suggest that the same minibatch strategy coupled with unbalanced optimal transport can yield more robust behavior. We discuss the associated theoretical properties, such as unbiased estimators, existence of gradients and concentration bounds. Our experimental study shows that in challenging problems associated to domain adaptation, the use of unbalanced optimal transport leads to significantly better results, competing with or surpassing recent baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Kilian Fatras (18 papers)
  2. Thibault Séjourné (8 papers)
  3. Nicolas Courty (55 papers)
  4. Rémi Flamary (70 papers)
Citations (134)

Summary

We haven't generated a summary for this paper yet.