Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems (1610.06519v2)

Published 20 Oct 2016 in math.OC, cs.CE, and math.NA

Abstract: Scaling algorithms for entropic transport-type problems have become a very popular numerical method, encompassing Wasserstein barycenters, multi-marginal problems, gradient flows and unbalanced transport. However, a standard implementation of the scaling algorithm has several numerical limitations: the scaling factors diverge and convergence becomes impractically slow as the entropy regularization approaches zero. Moreover, handling the dense kernel matrix becomes unfeasible for large problems. To address this, we combine several modifications: A log-domain stabilized formulation, the well-known epsilon-scaling heuristic, an adaptive truncation of the kernel and a coarse-to-fine scheme. This permits the solution of larger problems with smaller regularization and negligible truncation error. A new convergence analysis of the Sinkhorn algorithm is developed, working towards a better understanding of epsilon-scaling. Numerical examples illustrate efficiency and versatility of the modified algorithm.

Authors (1)

Bernhard Schmitzer (35 papers)

Citations (202)

View on Semantic Scholar

Summary

Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems

The paper "Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems" by Bernhard Schmitzer addresses significant computational challenges inherent in solving optimal transport (OT) problems using entropy regularization. In particular, it focuses on scaling algorithms, such as those used in the computation of Wasserstein distances, barycenters, and gradient flows, which are prone to numerical instability and inefficient convergence rates as the entropy regularization parameter approaches zero.

Key Contributions

Log-domain Stabilization: The paper introduces a log-domain stabilized formulation for scaling algorithms. The divergence of scaling factors in standard implementations can lead to numerical overflow. By stabilizing the computations in the log-domain, this formulation accommodates smaller regularizations, enhancing the numerical stability of the solutions.
ε-scaling Heuristic: A well-established concept in auction algorithms used to accelerate convergence by gradually decreasing the regularization parameter during optimization. The paper incorporates this ε-scaling heuristic into scaling algorithms, mitigating the issue of slow convergence.
Adaptive Truncation of the Kernel: Memory and computational efficiency are significantly improved by adaptively truncating the dense kernel matrix, which otherwise becomes unmanageable for large-scale problems. The paper presents a method to compute this truncated kernel with negligible truncation error, facilitating the handling of large computational demands.
Multi-scale Scheme: A coarse-to-fine computational approach, which integrates effectively with the above strategies, enhances both the precision and scale of the problem-solving capacity.
New Convergence Analysis: The paper compares the Sinkhorn algorithm with the auction algorithm, offering a new convergence analysis that focuses on understanding the efficiency gains from ε-scaling. This understanding helps in refining the algorithm to make it more practical even for very small regularizations.

Implications and Speculation on Future Developments

Practically, these advancements allow for the resolution of larger entropy-regularized transport problems with better precision and less computational overhead. This has immediate applications in areas such as image processing, computer vision, and machine learning, where optimal transport is utilized for tasks requiring efficient metric transformations of data.

Theoretically, the work accomplishes a significant milestone towards understanding the complex limits and behaviors of entropy-regularized transport problems. This opens up further avenues for research, particularly in exploring the full theoretical underpinnings of ε-scaling within the context of the Sinkhorn algorithm and broader transport paradigms.

As the applications of artificial intelligence continue to expand, the need for robust, efficient computational tools like those proposed in Schmitzer's work becomes ever more critical. Continued exploration in this direction should focus on extending the concepts to multi-marginal problems and exploring potential parallels in other domains such as physics and economics, where similar optimization frameworks might be applicable. Potential future research should focus on more tightly integrating these new analytical tools within scalable AI systems, ensuring that they remain adaptable to a wide range of computational requirements.

In conclusion, Bernhard Schmitzer's work not only advances the practical utility of scaling algorithms for entropy regularized transport problems but also enriches the theoretical framework within which such algorithms operate. These contributions represent a significant step forward in both the understanding and application of optimal transport theory in computational sciences.

Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems (1610.06519v2)

Summary

Stabilized Sparse Scaling Algorithms for Entropy Regularized Transport Problems

Key Contributions

Implications and Speculation on Future Developments

Related Papers