Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
157 tokens/sec
GPT-4o
43 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fisher information dissipation for time inhomogeneous stochastic differential equations (2402.01036v1)

Published 1 Feb 2024 in math.PR, cs.LG, and stat.ML

Abstract: We provide a Lyapunov convergence analysis for time-inhomogeneous variable coefficient stochastic differential equations (SDEs). Three typical examples include overdamped, irreversible drift, and underdamped Langevin dynamics. We first formula the probability transition equation of Langevin dynamics as a modified gradient flow of the Kullback-Leibler divergence in the probability space with respect to time-dependent optimal transport metrics. This formulation contains both gradient and non-gradient directions depending on a class of time-dependent target distribution. We then select a time-dependent relative Fisher information functional as a Lyapunov functional. We develop a time-dependent Hessian matrix condition, which guarantees the convergence of the probability density function of the SDE. We verify the proposed conditions for several time-inhomogeneous Langevin dynamics. For the overdamped Langevin dynamics, we prove the $O(t{-1/2})$ convergence in $L1$ distance for the simulated annealing dynamics with a strongly convex potential function. For the irreversible drift Langevin dynamics, we prove an improved convergence towards the target distribution in an asymptotic regime. We also verify the convergence condition for the underdamped Langevin dynamics. Numerical examples demonstrate the convergence results for the time-dependent Langevin dynamics.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. First-order optimization algorithms via inertial systems with Hessian driven damping. Mathematical Programming, 1–43, 2020.
  2. Convergence of iterates for first-order optimization algorithms with inertia and Hessian driven damping. Optimization, 1–40, 2021.
  3. E. Bayraktar, Q. Feng and W. Li. Exponential Entropy dissipation for weakly self-consistent Vlasov-Fokker-Planck equations. Journal of Nonlinear science, 2024. (To appear).
  4. Long-time behaviour of degenerate diffusions: UFG-type SDEs and time-inhomogeneous hypoelliptic. Electron. J. Probab. 26 (2021), article no. 22, 1–72.
  5. Hypoelliptic non-homogenous diffusions Probab. Theory Relat. Fields. 123, 453–483 (2002).
  6. V. Cerny. Thermodynamical approach to the traveling salesman problem: an efficient simulation algorithm. J. Optim. Theory Appl., 45(1):41–51, 1985.
  7. Diffusion for global optimization in 𝐑nsuperscript𝐑𝑛{\bf R}^{n}bold_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. SIAM J. Control Optim., 25(3):737–753, 1987.
  8. L. Chizat. Mean-Field Langevin Dynamics: Exponential Convergence and Annealing. Transactions on Machine Learning Research. 2835-8856, 2022.
  9. Variance Reduction Using Nonreversible Langevin Samplers. Journal of Statistical Physics, 163, 457–491, 2016.
  10. Nonreversible Langevin Samplers: Splitting Schemes, Analysis and Implementation. arXiv:1701.04247, 2017.
  11. An improved annealing method and its large-time behavior. Stochastic Process. Appl., 71(1):55–74, 1997.
  12. Q. Feng and W. Li. Entropy Dissipation for Degenerate Stochastic Differential Equations via Sub-Riemannian Density Manifold. Entropy, 25, 786, 2023.
  13. Q. Feng and W. Li. Hypoelliptic Entropy dissipation for stochastic differential equations. Preprint, arXiv:2102.00544, 2021.
  14. State-dependent temperature control for Langevin diffusions. arXiv:2005.04507, 2020.
  15. S. Geman and C.R. Hwang. Diffusions for global optimization. SIAM J. Control Optim., 24(5):1031–1043, 1986.
  16. L. Hörmander. Hypoelliptic second order differential equations. Acta Math. 119: 147-171 (1967).
  17. Strongly degenerate time inhomogeneous SDEs: Densities and support properties. Application to Hodgkin–Huxley type systems Bernoulli 23(4A), 2017, 2587–2616.
  18. Optimization by simulated annealing. Science, 220(4598):671–680, 1983.
  19. Beyond Log-concavity: Provable Guarantees for Sampling Multi-modal Distributions using Simulated Tempering Langevin Monte Carlo. In Advances in Neural Information Processing Systems (NeurIPS), 2018.
  20. Is there an analog of Nesterov acceleration for gradient-based MCMC? Bernoulli, 27 (3), 1942-1992, 2021.
  21. O. Mangoubi and N. K. Vishnoi. Convex Optimization with Unbounded Nonconvex Oracles using Simulated Annealing. In Proc. of Conference on Learning Theory (COLT), 2018.
  22. Simulated Tempering: A New Monte Carlo Scheme. Europhysics Letters (EPL), 19(6):451–458, 1992.
  23. P. Monmarché. Hypocoercivity in metastable settings and kinetic simulated annealing. Probability Theory and Related Fields, pages 1–34, 2018.
  24. Ergodicity of the infinite swapping algorithm at low temperature. 2018. arXiv:1811.10174.
  25. Applications of the Malliavin calculus, Part I. In North-Holland Mathematical Library, vol. 32, pp. 271-306. Elsevier, 1984.
  26. Simulated annealing from continuum to discretization: a convergence analysis via the Eyring-Kramers law. Preprint, arXiv:2102.02339, 2021.
  27. P. J. M. van Laarhoven and E. H. L. Aarts. Simulated annealing: theory and applications, volume 37 of Mathematics and its Applications. D. Reidel Publishing Co., 1987.
  28. Hypoellipticity Theorems and Conditional Laws. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 65, 573–597 (1984).
  29. C. Villani. Hypocoercivity, Memoirs of the American Mathematical Society, 2009.
  30. C. Villani. Optimal Transport: Old and New, 2009.
  31. Geometry-informed irreversible perturbations for accelerated convergence of Langevin dynamics. Stat Comput, 32, 78, 2022.
  32. Primal-Dual damping algorithms for optimization. Annals of Mathematical Sciences and Applications, 2024. (To appear).
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com