38. Kalman decomposition#
Controllability and Non-observability properties define two sub-spaces of the state space, \(\mathbb{R}^n\). The complements of these subpsaces define subspaces as well.
Kalman decomposition applies a coordinate transformation in order to make explicit the existence of:
\(X_1\), non-observable and reachable sub-space, \(X_1 := X_{\overline{o}} \cap X_r\)
\(X_2\), the complement of \(X_1\) w.r.t. the reachable sub-space, i.e. \(X_1 \cup X_2 = X_r\)
\(X_3\), the complement of \(X_1\) w.r.t. the non-observable sub-space, i.e. \(X_1 \cup X_3 = X_{\overline{o}}\)
\(X_4\), the complement of \(X_1\), \(X_2\), \(X_3\) w.r.t. the state-space \(\mathbb{R}^n\).
\(A\)-invariance of \(\text{Ran}(\mathbf{W}_c)\) and \(\text{Ker}(\mathbf{W}_o)\) is exploited.
In this section, singular value decomposition of the Gramians of observability and controllability helps in the decomposition of a LTI system in its non-observable/observable and controllable/non-controllable parts.
As shown in the section about coordinate transformation, in the Kalman decomposition subsection, only the observable and controllable parts appear in the input-output transfer function of the system.
todo SVD may help or not, but that’s not the point here. We’re supposed to know how to find an orthonormal basis, and if we can’t remember how to do it, we’re supposed to find a the method we like the most to do that. Orthonormal basis are not even strictly required here. Moreover, SVD of symmetric (semi) definite positive matrices coincides with spectral decomposition…
todo Infinite-horizon time, or steady-state conditions, are discussed here?
38.1. Controllability#
Controllability
Let \(\mathbf{W}_c\) the controllability Gramian of a LTI system. Controllable states belong to the range of \(W_c\). Controllable states define a sub-space of \(\mathbb{R}^n\), being \(\mathbf{x} \in \mathbb{R}^n\). Singular value decomposition (or spectral decomposition) of the semi-poisitive definite symmetric Gramian matrix,
provides a decomposition of \(\mathbb{R}^n\) into the controllable sub-space and its orthogonal complement, with
with the columns of \(\mathbf{U}_c\) as a base of the controllable sub-space. Introducing a change of variable of the state
the LTI system becomes
and thus
Properties
The columns of the matrix \(\mathscr{C} = [ \, \mathbf{B} \, | \, \mathbf{A} \mathbf{B} \, | \, \dots \, | \, \mathbf{A}^{n-1} \mathbf{B} \, ]\) are linear combinations of the range of \(\mathbf{W}_C\), i.e. of the columns of \(\mathbf{U}_c\), and thus they’re orthogonal w.r.t. the columns of \(\mathbf{U}_{\overline{c}}\). It follows that
as \(\mathbf{U}^*_{\overline{c}} \mathbf{A}^k \mathbf{B} = \mathbf{U}_{\overline{c}}^* \mathbf{U}_c \boldsymbol\alpha\) and \(\mathbf{U}_{\overline{c}}^* \mathbf{U}_c = \mathbf{0}\)
38.2. Observability#
Observability
Let \(\mathbf{W}_o\) the observability Gramian of a LTI system. Non-observable states belong to the kernel of \(W_o\). Non-observable states define a sub-space of \(\mathbb{R}^n\), being \(\mathbf{x} \in \mathbb{R}^n\). Singular value decomposition (or spectral decomposition) of the semi-poisitive definite symmetric Gramian matrix,
provides a decomposition of \(\mathbb{R}^n\) into the non-observable sub-space and its orthogonal complement, with
with the columns of \(\mathbf{U}_{\overline{o}}\) as a base of the non-observable sub-space. Introducing a change of variable of the state
the LTI system becomes
and thus
as \(\mathbf{U}_o^* \mathbf{A} \mathbf{U}_{\overline{o}} = \mathbf{0}\) (see below).
Properties
The kernel of the observability Gramian is the sub-space of vectors \(\mathbf{v}\) so that \(\mathscr{O} \mathbf{v} = \mathbf{0}\), with
It follows that
so that the rows of \(\mathbf{C}\) and the rows of \(\mathbf{C} \mathbf{A}^k\) are linear combinations of the columns of \(\mathbf{U}_o\), i.e. the columns of \(\left( \mathbf{C} \mathbf{A}^k \right)^*\) are linear combinations of the columns of \(\mathbf{U}_o\),
or
As the property holds for every \(k \in \mathbb{N}\), the following holds as well
38.3. Kalman decomposition#
Let
\(\mathbf{U}_1\) spanning the reachable and non-observable sub-psace, \(R \cap \overline{O}\)
\(\mathbf{U}_2\) complementing \(\mathbf{U}_1\) to get the reachable sub-sspace \(R\)
\(\mathbf{U}_3\) complementing \(\mathbf{U}_3\) to get the non-observable sub-space \(O\)
\(\mathbf{U}_4\) complementing \(\mathbf{U}_1\), \(\mathbf{U}_2\), \(\mathbf{U}_3\) to get \(\mathbb{R}^n\)
The columns of \(\mathbf{U}_1\) and \(\mathbf{U}_3\) form a vector basis of the non-observable sub-space. The columns of \(\mathbf{U}_1\) and \(\mathbf{U}_2\) form a vector basis of the reachable subspace.
A coordinate transfromation of the form
produce a transformed system with the following structure
Details
Columns of \(\ \mathbf{T}\). Here the columns of \(\mathbf{U}_1\) are assumed to be a set of unit normal vectors, \(\mathbf{U}_1^* \mathbf{U}_1 = \mathbf{I}\). Without this assumption, Gram-Schmidt orthogonalization process ensure a orthogonal basis \(\boldsymbol\Phi_1\) exists and the basis in \(\mathbf{U}_1\) can be written as a linear combination of the columns of \(\boldsymbol\Phi_1\), i.e. \(\mathbf{U}_1 = \boldsymbol\Phi_1 \boldsymbol\alpha_1\), with non-singular \(\boldsymbol\alpha_1\). Columns of \(\mathbf{U}_2\) are linearly independent from the columns of \(\mathbf{U}_1\) to form a basis of the reachable sub-space. Columns \(\mathbf{U}_3\) are linearly independent from the columns of \(\mathbf{U}_1\) to form a basis of the non-observable sub-space. In general, they’re not unit orthogonal but a unit orthogonal basis can be found, to get
In general, the columns of \(\boldsymbol\Phi_2\) and \(\boldsymbol\Phi_3\) are not mutually orthogonal, as the reachable sub-space and the non-observable sub-space are not orthogonal, \(\boldsymbol\Phi_2^* \boldsymbol\Phi_3 \ne \mathbf{0}\). The columns of the complement \(\mathbf{U}_4\) are assumed to be unit normal vectors, orthogonal w.r.t. the columns of \(\mathbf{U}_1\), \(\mathbf{U}_2\), \(\mathbf{U}_3\) (and of \(\boldsymbol\Phi_2\), \(\boldsymbol\Phi_3\)).
Recast \(\ \mathbf{T}\).
Inverse \(\ \mathbf{T}^{-1}\).
with
Details
Transformed matrices: matrix \(\hat{\mathbf{C}}\).
as \(\mathbf{C} \mathbf{U}_1 = \mathbf{0}\), \(\mathbf{C} \boldsymbol\Phi_3 = \mathbf{0}\). Looking at the structures of the matrices \(\mathbf{C} \boldsymbol\Phi\) and \(\boldsymbol\alpha\),
it’s easy to show that this multiplication preserves the structure of the matrix \(\mathbf{C}\boldsymbol\Phi\). Explicitly evaluating the non-zero blocks, the transformed matrix reads
Transformed matrices: matrix \(\hat{\mathbf{B}}\).
as \(\mathbf{U}_1^* \mathbf{B} = \mathbf{0}\), \(\left[ \left( \mathbf{I} - \boldsymbol\Phi_2 \boldsymbol\Phi_2^* \right) \boldsymbol\Phi_3 \right]^* \mathbf{B} = \mathbf{0}\). The last relation holds, as \(\left( \mathbf{I} - \boldsymbol\Phi_2 \boldsymbol\Phi_2^* \right) \boldsymbol\Phi_3\) is the orthogonal projection of \(\boldsymbol\Phi_3\) perpendicular to the vectors \(\boldsymbol\Phi_2\). Since this projection is perpendicular both to \(\mathbf{U}_1\) and to \(\boldsymbol\Phi_2\) (that generates the reachable sub-space), it follows that \(\left[ \left( \mathbf{I} - \boldsymbol\Phi_2 \boldsymbol\Phi_2^* \right) \boldsymbol\Phi_3 \right]^* \mathbf{B} = \mathbf{0}\). The same argument gives \(\mathbf{U}_4^* \mathbf{B} = \mathbf{0}\), and thus
Looking at the structure of the matrices \(\boldsymbol\alpha^{-1}\) and \(\boldsymbol\Phi^{-1} \mathbf{B}\), it’s easy to show that their matrix multiplication preserves the structure
Transformed matrices: matrix \(\hat{\mathbf{A}}\).
The structure of the matrix arises from \(\mathbf{A}\)-invariance of reachability and controllability sub-spaces and from the properties
and recalling that \(\mathbf{U}_1: \, r \overline{o}\), \((\mathbf{I}-\boldsymbol\Phi_3 \boldsymbol\Phi_3)^* \boldsymbol\Phi_2: r o \, \), \((\mathbf{I}-\boldsymbol\Phi_2 \boldsymbol\Phi_2)^* \boldsymbol\Phi_3: \, \overline{r} \overline{o}\), \(\mathbf{U}_4: \, \overline{r} o\). Focusing on \(\boldsymbol\Phi^{-1} \mathbf{A} \boldsymbol\Phi\) first
Pre- and post-multiplication by \(\boldsymbol\alpha^{-1}\) and \(\boldsymbol\alpha\) preserves the structure of \(\boldsymbol\Phi^{-1} \mathbf{A} \boldsymbol\Phi\) to have