Computing Transition Pathways for the Study of Rare Events Using Deep Reinforcement Learning (2404.05905v1)
Abstract: Understanding the transition events between metastable states in complex systems is an important subject in the fields of computational physics, chemistry and biology. The transition pathway plays an important role in characterizing the mechanism underlying the transition, for example, in the study of conformational changes of bio-molecules. In fact, computing the transition pathway is a challenging task for complex and high-dimensional systems. In this work, we formulate the path-finding task as a cost minimization problem over a particular path space. The cost function is adapted from the Freidlin-Wentzell action functional so that it is able to deal with rough potential landscapes. The path-finding problem is then solved using a actor-critic method based on the deep deterministic policy gradient algorithm (DDPG). The method incorporates the potential force of the system in the policy for generating episodes and combines physical properties of the system with the learning process for molecular systems. The exploitation and exploration nature of reinforcement learning enables the method to efficiently sample the transition events and compute the globally optimal transition pathway. We illustrate the effectiveness of the proposed method using three benchmark systems including an extended Mueller system and the Lennard-Jones system of seven particles.
- Transition path sampling: Throwing ropes over rough mountain passes, in the dark. Annu. Rev. Phys. Chem., 53(1):291–318, 2002.
- Efficient transition path sampling: Application to lennard-jones cluster rearrangements. J. Chem. Phys., 108(22):9236–9245, 1998.
- Transition path sampling. Adv. Chem. Phys., 123:1–78, 2002.
- The graph limit of the minimizer of the onsager-machlup functional and its computation. Science China Mathematics, 64:239–280, 2021.
- Transition-path theory and path-finding algorithms for the study of rare events. Annu. Rev. Phys. Chem., 61:391–420, 2010.
- String method for the study of rare events. Phys. Rev. B, 66(5):052301, 2002.
- Minimum action method for the study of rare events. Commun. Pure Appl. Math., 57(5):637–656, 2004.
- Finite temperature string method for the study of rare events. J. Phys. Chem. B, 109(14):6688–6693, 2005.
- Simplified and improved string method for computing the minimum energy paths in barrier-crossing events. J. Chem. Phys., 126(16), 2007.
- Computing committors in collective variables via mahalanobis diffusion maps. Applied and Computational Harmonic Analysis, 64:62–101, 2023.
- Conjugate peak refinement: an algorithm for finding reaction paths and accurate transition states in systems with many degrees of freedom. Chem. Phys. Lett., 194(3):252–261, 1992.
- Fleming, W. H. Exit probabilities and optimal stochastic control. Applied Mathematics and Optimization, 4:329–346, 1977.
- Random Perturbations of Dynamical Systems. Springer Press, Berlin, Heidelberg, 2012.
- Addressing function approximation error in actor-critic methods. In Proceedings of the 35th International Conference on Machine Learning, volume 80, pp. 1587–1596. PMLR, 2018.
- Onsager–machlup action-based path sampling and its combination with replica exchange for diffusive and multiple pathways. J. Chem. Phys., 132(13), 2010.
- Numerical computation of rare events via large deviation theory. Chaos: An Interdisciplinary Journal of Nonlinear Science, 29(6), 2019.
- Deep reinforcement learning in finite-horizon to explore the most probable transition pathway. Physica D: Nonlinear Phenomena, 458:133955, 2024.
- Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In Proceedings of the 35th International Conference on Machine Learning, volume 80, pp. 1861–1870. PMLR, 2018.
- The geometric minimum action method: A least action principle on the space of curves. Commun. Pure Appl. Math., 61(8):1052–1117, 2008.
- Nudged elastic band method for finding minimum energy paths of transitions. In Classical and quantum dynamics in condensed phase simulations, pp. 385–404. World Scientific, 1998.
- Solving for high-dimensional committor functions using artificial neural networks. Research in the Mathematical Sciences, 6:1–13, 2019.
- Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations, 2015.
- Computing committor functions for the study of rare events using deep learning. J. Chem. Phys., 151(5), 2019.
- Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015.
- String method in collective variables: Minimum free energy paths and isocommittor surfacesg. J. Chem. Phys., 125(2), 2006.
- Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013.
- Human-level control through deep reinforcement learning. Nature, 518(7540):529–533, 2015.
- Calculation of classical trajectories with a very large time step: Formalism and numerical examples. J. Chem. Phys., 105(20):9299–9315, 1996.
- Fluctuations and irreversible processes. Physical Review, 91(6):1505, 1953.
- Transition pathways in complex systems: Application of the finite-temperature string method to the alanine dipeptide. J. Chem. Phys., 123(13):6688–6693, 2005.
- Active importance sampling for variational objectives dominated by rare events: Consequences for optimization and generalization. In Mathematical and Scientific Machine Learning, pp. 757–780, 2022.
- Deterministic policy gradient algorithms. In Proceedings of the 31st International Conference on Machine Learning, volume 32, pp. 387–395, 2014.
- Voter, A. F. Hyperdynamics: Accelerated molecular dynamics of infrequent events. Phys. Rev. Lett., 78(20):3908, 1997.
- Kinetic paths, time scale, and underlying landscapes: A path integral framework to study global natures of nonequilibrium systems and networks. J. Chem. Phys., 133(12), 2010.
- Adaptive minimum action method for the study of rare events. J. Chem. Phys., 128(10), 2008.
- Bo Lin (68 papers)
- Yangzheng Zhong (1 paper)
- Weiqing Ren (14 papers)