Separation principle

In control theory, a separation principle, more formally known as a principle of separation of estimation and control, states that under some assumptions the problem of designing an optimal feedback controller for a stochastic system can be solved by designing an optimal observer for the state of the system, which feeds into an optimal deterministic controller for the system. Thus the problem can be broken into two separate parts, which facilitates the design.

The first instance of such a principle is in the setting of deterministic linear systems, namely that if a stable observer and a stable state feedback are designed for a linear time-invariant system, then the combined observer and feedback is stable. The separation principle does not hold in general for non-linear systems in general. Another instance of the separation principle arises in the setting of linear stochastic systems, namely that state estimation (possibly nonlinear) together with an optimal state feedback controller designed to minimize a quadratic cost, is optimal for the stochastic control problem with output measurements. When process and observation noise are Gaussian, the optimal solution separates into a Kalman filter and a linear-quadratic regulator. This is known as linear-quadratic-Gaussian control. More generally, under suitable conditions and when the noise is a martingale (with possible jumps), again a separation principle applies and is known as the separation principle in stochastic control[1] [2] [3] [4] [5] [6]. A separation principle also exists for the control of quantum systems.

Proof of separation principle for deterministic LTI systems

Consider a deterministic LTI system:

where

represents the input signal,
represents the output signal, and
represents the internal state of the system.

We can design an observer of the form

and state feedback

Define the error e:

Then

Now we can write the closed-loop dynamics as

Since this is triangular, the eigenvalues are just those of A  BK together with those of A  LC.[7] Thus the stability of the observer and feedback are independent.

gollark: All my passwords are stored in some bit of the PotatOS code if I forget them. But I forgot which bit.
gollark: Or just use C L O U D since you're apparently okay with thatn
gollark: <@151391317740486657> Why not S E L F H O S T a password manager?
gollark: https://discordapp.com/channels/198130613759246337/530071845181849630/541628633819906078
gollark: Didn't you complain about it then ask for one you could send to someone else?

References

  1. Karl Johan Astrom (1970). Introduction to Stochastic Control Theory. 58. Academic Press. ISBN 0-486-44531-3..
  2. Tyrone Duncan and Pravin Varaiya (1971). "On the solutions of a stochastic control system". SIAM J. Control. 9 (3): 354–371..
  3. M.H.A. Davis and P. Varaiya (1972). "Information states for stochastic systems". J. Math. Anal. Applications. 37: 384–402..
  4. Anders Lindquist (1973). "On Feedback Control of Linear Stochastic Systems". SIAM Journal on Control. 11: 323–343..
  5. A. Bensoussan (1992). Stochastic Control of Partially Observable Systems. Cambridge University Press..
  6. Tryphon T. Georgiou and Anders Lindquist (2013). "The Separation Principle in Stochastic Control, Redux". IEEE Transactions on Automatic Control. 58 (10): 2481–2494. arXiv:1103.3005. doi:10.1109/TAC.2013.2259207..
  7. Proof can be found in this math.stackexchange .
  • Brezinski, Claude. Computational Aspects of Linear Control (Numerical Methods and Algorithms). Springer, 2002.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.