Generic ODE without exogenous inputs

46.1.1. Generic ODE without exogenous inputs#

Let a general general ODE, with \(\mathbf{M}\) non singular (otherwise DAE…any issue with DAE?)

\[\mathbf{M} \dot{\mathbf{x}} = \mathbf{f}(\mathbf{x}, \mathbf{u}) \ ,\]

with initial condition \(\mathbf{x}(0) = \mathbf{x}_0\).

46.1.1.1. Variational approach#

The objective function combines (weights) the error on a desired performance and the control input, in order to get the desired behavior with feasible control (that can be provided by actuators, without saturation, avoiding unnecessary high power input and too sharp behavior,…)

As an example, if the goal of the control \(\mathbf{u}\) is to keep the system around \(\mathbf{x} = \mathbf{0}\), the cost function to be minimized can be designed as

\[\begin{split}J = \int_{t=0}^{T} \dfrac{1}{2} \begin{bmatrix} \mathbf{x}^T & \mathbf{u}^T \end{bmatrix} \begin{bmatrix} \mathbf{Q} & \mathbf{S} \\ \mathbf{S}^T & \mathbf{R} \end{bmatrix} \begin{bmatrix} \mathbf{x} \\ \mathbf{u} \end{bmatrix} \, dt + \frac{1}{2} \mathbf{x}^T(T) \mathbf{Q}_T \mathbf{x}(T) \ ,\end{split}\]

with \(\mathbf{Q} \ge 0\), and \(\mathbf{R} > 0\) and symmetric.

Constrained optimization

\[\widetilde{J}(\mathbf{x},\mathbf{u}; \boldsymbol\lambda) = \int_{\tau=0}^{T} C\left( \mathbf{x}(\tau), \mathbf{u}(\tau) \right) \, d\tau + D\left( \mathbf{x}(T) \right) - \int_{\tau=0}^{T} \boldsymbol\lambda^T \left( \mathbf{M} \dot{\mathbf{x}} - \mathbf{f}(\mathbf{x}(\tau), \mathbf{u}(\tau)) \right) \, d \tau \ .\]

\[\begin{split}\begin{aligned} 0 = \delta \widetilde{J} & = \int_{t=0}^{T} \delta \mathbf{x}^T \left( \partial_\mathbf{x} C + \partial_{\mathbf{x}} \mathbf{f}^T \boldsymbol\lambda + \mathbf{M}^T \dot{\boldsymbol{\lambda}}\right) d \tau + \\ & + \int_{t=0}^{T} \delta \mathbf{u}^T \left( \partial_\mathbf{u} C + \partial_{\mathbf{u}} \mathbf{f}^T \boldsymbol\lambda \right) d \tau + \\ & + \delta \mathbf{x}^T(T) \partial_{\mathbf{x}_T} D + \\ & + \int_{t=0}^{T} \delta \boldsymbol\lambda^T ( \mathbf{M} \dot{\mathbf{x}} - \mathbf{f}(\mathbf{x}, \mathbf{u}) ) d \tau + \\ & - \left. \delta \mathbf{x} \mathbf{M}^T \boldsymbol\lambda \right|_{t=0}^{T} \ . \end{aligned}\end{split}\]

So that

(46.1)#\[\begin{split}\begin{aligned} \delta \boldsymbol\lambda(t): & \quad \mathbf{M} \dot{\mathbf{x}} = \mathbf{f} \\ & \quad \mathbf{x}(0) = \mathbf{x}_0 \\ \delta \mathbf{x}(t) : & \quad \mathbf{M}^T \dot{\boldsymbol{\lambda}} = -\partial_{\mathbf{x}} \mathbf{f}^T \boldsymbol\lambda - \partial_{\mathbf{x}} C \\ \delta \mathbf{x}(T) : & \quad \mathbf{M}^T \boldsymbol\lambda(T) = \partial_{\mathbf{x}_T} D \\ \delta \mathbf{u}(t) : & \quad \mathbf{0} = \partial_\mathbf{u} C + \partial_\mathbf{u} \mathbf{f}^T \boldsymbol\lambda \ . \end{aligned}\end{split}\]

The equations can be recast after the definition of the Hamiltonian of the system \(H(\mathbf{x},\mathbf{u},\boldsymbol\lambda) = C(\mathbf{x},\mathbf{u}) + \boldsymbol\lambda^T \mathbf{f}(\mathbf{x},\mathbf{u})\) as

(46.2)#\[\begin{split}\begin{aligned} \delta \boldsymbol\lambda(t): & \quad \mathbf{0} = - \mathbf{M} \dot{\mathbf{x}} + \partial_{\boldsymbol\lambda} H = - \mathbf{M} \dot{\mathbf{x}} + \mathbf{f} \\ & \quad \mathbf{x}(0) = \mathbf{x}_0 \\ \delta \mathbf{x}(t) : & \quad \mathbf{M}^T \dot{\boldsymbol{\lambda}} = - \partial_{\mathbf{x}} H \\ \delta \mathbf{x}(T) : & \quad \mathbf{M}^T \boldsymbol\lambda(T) = \partial_{\mathbf{x}_T} D \\ \delta \mathbf{u}(t) : & \quad \mathbf{0} = \partial_\mathbf{u} H \ . \end{aligned}\end{split}\]

46.1.1.2. Dynamic programming approach#

Dynamic programming approach for optimal control may be interpreted as the continuous-time counterpart fo the dynamic programming approach used in reinforcement learning (RL) on Markov decision processes. While RL usually produces a model-free policy - control law - and thus exploits Bellman’s equations on the state-action value function \(Q(s;a)\), optimal control is a model-based control system design that relies on a model of the system and uses Bellman’s equation for the state value function \(V(s)\).

Generic ODE without exogenous inputs

Contents

46.1.1. Generic ODE without exogenous inputs#

46.1.1.1. Variational approach#

46.1.1.2. Dynamic programming approach#