On the geometry of Stein variational gradient descent (1912.00894v2)

Published 2 Dec 2019 in stat.ML, cs.LG, math.AP, math.ST, and stat.TH

Abstract: Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely on iterated steepest descent steps with respect to a reproducing kernel Hilbert space norm. This construction leads to interacting particle systems, the mean-field limit of which is a gradient flow on the space of probability distributions equipped with a certain geometrical structure. We leverage this viewpoint to shed some light on the convergence properties of the algorithm, in particular addressing the problem of choosing a suitable positive definite kernel function. Our analysis leads us to considering certain nondifferentiable kernels with adjusted tails. We demonstrate significant performance gains of these in various numerical experiments.

Citations (89)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

On the geometry of Stein variational gradient descent (1912.00894v2)

Summary

Related Papers