Mixed $\mathcal{H}_2/\mathcal{H}_\infty$-Policy Learning Synthesis (2302.08846v2)
Abstract: A robustly stabilizing optimal control policy in a model-free mixed $\mathcal{H}2/\mathcal{H}\infty$-control setting is here put forward for counterbalancing the slow convergence and non-robustness of traditional high-variance policy optimization (and by extension policy gradient) algorithms. Leveraging It^{o}'s stochastic differential calculus, we iteratively solve the system's continuous-time closed-loop generalized algebraic Riccati equation whilst updating its admissible controllers in a two-player, zero-sum differential game setting. Our new results are illustrated by learning-enabled control systems which gather previously disseminated results in this field in one holistic data-driven presentation with greater simplification, improvement, and clarity.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.