Amortized Variational Inference: When and Why? (2307.11018v4)

Published 20 Jul 2023 in stat.ML and cs.LG

Abstract: In a probabilistic latent variable model, factorized (or mean-field) variational inference (F-VI) fits a separate parametric distribution for each latent variable. Amortized variational inference (A-VI) instead learns a common inference function, which maps each observation to its corresponding latent variable's approximate posterior. Typically, A-VI is used as a step in the training of variational autoencoders, however it stands to reason that A-VI could also be used as a general alternative to F-VI. In this paper we study when and why A-VI can be used for approximate Bayesian inference. We derive conditions on a latent variable model which are necessary, sufficient, and verifiable under which A-VI can attain F-VI's optimal solution, thereby closing the amortization gap. We prove these conditions are uniquely verified by simple hierarchical models, a broad class that encompasses many models in machine learning. We then show, on a broader class of models, how to expand the domain of AVI's inference function to improve its solution, and we provide examples, e.g. hidden Markov models, where the amortization gap cannot be closed.

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - charlesm93/AVI-when-and-why: Code to reproduce experiments in paper: "Amortized Variational Inference: When and Why?" (6 stars)

Tweets

https://twitter.com/charlesm993/status/1783988576952123766

Amortized Variational Inference: When and Why? (2307.11018v4)

Summary

Related Papers

GitHub

Tweets