Efficient Semi-Implicit Variational Inference (2101.06070v1)

Published 15 Jan 2021 in cs.LG

Abstract: In this paper, we propose CI-VI an efficient and scalable solver for semi-implicit variational inference (SIVI). Our method, first, maps SIVI's evidence lower bound (ELBO) to a form involving a nonlinear functional nesting of expected values and then develops a rigorous optimiser capable of correctly handling bias inherent to nonlinear nested expectations using an extrapolation-smoothing mechanism coupled with gradient sketching. Our theoretical results demonstrate convergence to a stationary point of the ELBO in general non-convex settings typically arising when using deep network models and an order of $O(t^{{-\frac{4}{5}})$} gradient-bias-vanishing rate. We believe these results generalise beyond the specific nesting arising from SIVI to other forms. Finally, in a set of experiments, we demonstrate the effectiveness of our algorithm in approximating complex posteriors on various data-sets including those from natural language processing.

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Semi-Implicit Variational Inference via Score Matching (2023)
Doubly Semi-Implicit Variational Inference (2018)
Semi-Implicit Variational Inference (2018)
Kernel Semi-Implicit Variational Inference (2024)
Particle Semi-Implicit Variational Inference (2024)