Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
98 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adjoint Monte Carlo Method (2401.08361v2)

Published 16 Jan 2024 in math.NA, cs.NA, and physics.comp-ph

Abstract: This survey explores the development of adjoint Monte Carlo methods for solving optimization problems governed by kinetic equations, a common challenge in areas such as plasma control and device design. These optimization problems are particularly demanding due to the high dimensionality of the phase space and the randomness in evaluating the objective functional, a consequence of using a forward Monte Carlo solver. To overcome these difficulties, a range of ``adjoint Monte Carlo methods'' have been devised. These methods skillfully combine Monte Carlo gradient estimators with PDE-constrained optimization, introducing innovative solutions tailored for kinetic applications. In this review, we begin by examining three primary strategies for Monte Carlo gradient estimation: the score function approach, the reparameterization trick, and the coupling method. We also delve into the adjoint-state method, an essential element in PDE-constrained optimization. Focusing on applications in the radiative transfer equation and the nonlinear Boltzmann equation, we provide a comprehensive guide on how to integrate Monte Carlo gradient techniques within both the optimize-then-discretize and the discretize-then-optimize frameworks from PDE-constrained optimization. This approach leads to the formulation of effective adjoint Monte Carlo methods, enabling efficient gradient estimation in complex, high-dimensional optimization problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (61)
  1. The effect of stabilization in finite element methods for the optimal boundary control of the Oseen equations. Finite Elements in Analysis and Design, 41(3):229–251, 2004.
  2. The direct simulation Monte Carlo method. Computers in Physics, 11(6):588–593, 1997.
  3. A convergence proof for Nanbu’s simulation method for the full Boltzmann equation. SIAM Journal on Numerical Analysis, 26(1):45–65, 1989.
  4. Guillaume Bal. Radiative transfer equations with varying refractive index: a mathematical perspective. JOSA A, 23(7):1639–1644, 2006.
  5. Discretize then optimize. Mathematics for industry: challenges and frontiers, pages 140–157, 2005.
  6. GA Bird. Direct simulation and the Boltzmann equation. The Physics of Fluids, 13(11):2676–2681, 1970.
  7. Graeme A Bird. Molecular gas dynamics and the direct simulation of gas flows. Molecular gas dynamics and the direct simulation of gas flows, 1994.
  8. AV Bobylev and K Nanbu. Theory of collision algorithms for gases and plasmas based on the Boltzmann equation and the Landau–Fokker–Planck equation. Physical Review E, 61(4):4576, 2000.
  9. Computational optimization of systems governed by partial differential equations. SIAM, 2011.
  10. Insensitive functionals, inconsistent gradients, spurious minima, and regularized functionals in flow optimization problems. International Journal of Computational Fluid Dynamics, 16(3):171–185, 2002.
  11. Adjoint DSMC for nonlinear Boltzmann equation constrained optimization. Journal of Computational Physics, 439:110404, 2021.
  12. Russel E Caflisch. The Boltzmann equation with a soft potential. Communications in Mathematical Physics, 74(1):71–95, 1980.
  13. Russel E Caflisch. Monte Carlo and quasi-Monte Carlo methods. Acta numerica, 7:1–49, 1998.
  14. Algorithmic differentiation for adjoint sensitivity calculation in plasma edge codes. Journal of Computational Physics, 491:112403, 2023.
  15. The mathematical theory of dilute gases, volume 106. Springer Science & Business Media, 2013.
  16. Han-Fu Chen. Stochastic approximation and its applications, volume 64. Springer Science & Business Media, 2005.
  17. Mathematical analysis and numerical methods for science and technology, volume 6. Springer Science & Business Media, 1993.
  18. Roger Eckhardt. Stan Ulam, John von Neumann, and the Monte Carlo Method. Los Alamos Science, 100(15):131, 1987.
  19. Suppressing Instability in a Vlasov-Poisson System by an External Electric Field Through Constrained Optimization. Journal of Computational Physics, 498:112662, 2024.
  20. Herbert Federer. Geometric measure theory. Springer, 2014.
  21. On the discretize then optimize approach. Preprint for Industrial and Systems Engineering, 2009.
  22. Paul Glasserman. Gradient estimation via perturbation analysis, volume 116. Springer Science & Business Media, 1990.
  23. Paul Glasserman. Monte Carlo Methods in Financial Engineering, volume 53. Springer Science & Business Media, 2013.
  24. Peter W Glynn. Likelihood ratio gradient estimation for stochastic systems. Communications of the ACM, 33(10):75–84, 1990.
  25. Topology optimization for rarefied gas flow problems using density method and adjoint IP-DSMC. Journal of Computational Physics, 474:111788, 2023.
  26. William W Hager. Runge-Kutta methods in optimal control and the transformed adjoint system. Numerische Mathematik, 87(2):247–282, 2000.
  27. Optimal control in radiative transfer. Optimisation Methods and Software, 22(6):917–936, 2007.
  28. Algorithms for PDE-constrained optimization. GAMM-Mitteilungen, 33(2):163–176, 2010.
  29. Optimization with PDE constraints, volume 23. Springer Science & Business Media, 2008.
  30. Discretization of optimal control problems. In Constrained Optimization and Optimal Control for Partial Differential Equations, pages 391–430. Springer, 2012.
  31. Solving large-scale optimization problems with a convergence rate independent of grid size. SIAM Journal on Numerical Analysis, 57(3):1100–1123, 2019.
  32. Reinforcement learning: A survey. Journal of artificial intelligence research, 4:237–285, 1996.
  33. Auto-encoding variational Bayes. In International Conference on Learning Representations, 2014.
  34. Optimization and sensitivity analysis of computer simulation models by the score function method. European Journal of Operational Research, 88(3):413–427, 1996.
  35. Variable soft sphere molecular model for inverse-power-law or Lennard-Jones potential. Physics of fluids A: fluid dynamics, 3(10):2459–2465, 1991.
  36. Handbook of Monte Carlo methods. John Wiley & Sons, 2013.
  37. Pierre L’Ecuyer. Note: On the interchange of derivative and expectation for likelihood ratio derivative estimators. Management Science, 41(4):738–747, 1995.
  38. Monte Carlo Gradient in Optimization Constrained by Radiative Transport Equation. SIAM Journal on Numerical Analysis, 61(6):2744–2774, 2023.
  39. Non-commutative discretize-then-optimize algorithms for elliptic PDE-constrained optimal control problems. Journal of Computational and Applied Mathematics, 362:596–613, 2019.
  40. Reversible random number generation for adjoint Monte Carlo simulation of the heat equation. arXiv preprint arXiv:2302.02778, 2023.
  41. The bias-variance dilemma of the Monte Carlo method. In Artificial Neural Networks—ICANN 2001: International Conference Vienna, Austria, August 21–25, 2001 Proceedings 11, pages 141–147. Springer, 2001.
  42. N Metropolis. The beginning of the Monte Carlo method. Los Alamos Science, 15:125–130, 1987.
  43. The Monte Carlo method. Journal of the American statistical association, pages 335–341, 1949.
  44. Monte Carlo gradient estimation in machine learning. Journal of Machine Learning Research, 21:1–62, 2020.
  45. Kenichi Nanbu. Direct simulation scheme derived from the Boltzmann equation. I. monocomponent gases. Journal of the Physical Society of Japan, 49(5):2042–2049, 1980.
  46. Stochastic gradient Markov chain Monte Carlo. Journal of the American Statistical Association, 116(533):433–450, 2021.
  47. Numerical optimization. Springer Science & Business Media, 2006.
  48. An introduction to Monte Carlo method for the Boltzmann equation. In ESAIM: Proceedings, volume 10, pages 35–75. EDP Sciences, 2001.
  49. Interacting multiagent systems: kinetic equations and Monte Carlo methods. OUP Oxford, 2013.
  50. Georg Ch Pflug. Optimization of stochastic models: the interface between simulation and optimization, volume 373. Springer Science & Business Media, 2012.
  51. Stochastic backpropagation and approximate inference in deep generative models. In International conference on machine learning, pages 1278–1286. PMLR, 2014.
  52. Reuven Y Rubinstein. The score function approach for sensitivity analysis of computer simulation models. Mathematics and Computers in Simulation, 28(5):351–379, 1986.
  53. Simulation and the Monte Carlo method. John Wiley & Sons, 2016.
  54. Efficiency of multivariate control variates in Monte Carlo simulation. Operations Research, 33(3):661–677, 1985.
  55. Importance sampling: a review. Wiley Interdisciplinary Reviews: Computational Statistics, 2(1):54–60, 2010.
  56. REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models. Advances in Neural Information Processing Systems, 30, 2017.
  57. Cédric Villani. A review of mathematical topics in collisional kinetic theory. In S. Friedlander and D. Serre, editors, Handbook of Mathematical Fluid Dynamics, volume 1 of Handbook of Mathematical Fluid Dynamics, pages 71–305. North-Holland, 2002.
  58. Statistical methods in neutron diffusion. Los Alamos Scientific report, 511:751–764, 1947.
  59. Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8:229–256, 1992.
  60. Adjoint DSMC for nonlinear spatially-homogeneous Boltzmann equation with a general collision model. Journal of Computational Physics, page 112247, 2023.
  61. An efficient topology optimization method for steady gas flows in all flow regimes. arXiv preprint arXiv:2308.07018, 2023.
Citations (1)

Summary

We haven't generated a summary for this paper yet.