Model Validation by Geometric Arguments

(1)

Model Validation by Geometric Arguments

T. McKelvey

Department of Electrical Engineering Linkoping University

S-581 83 Linkoping, Sweden

tomas@isy.liu.se

To be presented at ECC95

Report LiTH-ISY-R-1772, ISSN 1400-3902 Keywords: System identication, model validation, sin-

gular value decomposition, state-space models

Abstract

In this contribution we introduce a model quality measure called Modal Coherence Indicators which gives a com- plement to traditional model validation techniques. The indicators are derived by a geometric comparison of the model induced observability and controllability matrices with the data induced ones using a common description basis.

1 Introduction

Linear dynamical models play a key role in many indus- trial applications such as vibration analysis, design of controllers, signal processing etc. This paper deals with the question of assessing the quality of a given linear dy- namical model in state-space form by comparing its prop- erties with measured real world quantities. We will focus our presentation on the case when parts of the impulse response is the quantity which is measured or derived by some non-parametric method, e.g. correlation analysis.

In a linear model the location of the poles play an im- portant role of the model. The poles are usually associ- ated with the concept of modes were we distinguish be- tween oscillative modes corresponding to a pair of com- plex conjugate poles and sub-critically damped modes which correspond to real valued poles. In vibration anal- ysis of mechanical systems the aim is to identify the pole locations of the oscillative poles from which natural fre- quency and damping ratio can be determined. In a model validation step it is then important to verify that the identied pole locations in the model indeed are part of the measured data and not only a pole introduced by noisy measurements, data imperfections or numerical er-

ThisworkwassupportedbytheSwedishResearchCouncilfor

EngineeringSciences(TFR),whichisgratefullyacknowledged.

rors. The aim of this paper is to describe such a modal quality measure.

We will consider the linear model to be represented in a state-space form

x ( t + 1) = Ax ( t ) + Bu ( t )

y ( t ) = Cx ( t ) ⁽¹⁾ with input u ( t )

²^R

^m , output y ( t )

²^R

^p and state x ( t )

²

R

n .

The most straight-forward way to determine the va- lidity of a state-space model with respect to a measured empirical impulse response is simply to calculate the im- pulse response of the state-space model and compare it with the empirical one, e.g. calculate the norm of the errors. This type of measure only describes the total per- formance of the model and gives no information about which modes of the model are accurate and which are not. The modes are easily accessible from the eigenval- ues of the A matrix of a model (1) but are fairly hidden in the empirical impulse response unless a model is esti- mated from the impulse response. Hence, it is somewhat dicult to obtain a distributed measure which describes the quality of each of the modes of the model by com- paring the measured impulse response and the system ( ABC ) to each other.

The Modal Amplitude Coherence, introduced by Juang and Pappa 4] is one approach to obtain quality measures connected to each of the estimated modes of the system.

This measure uses the controllability matrix of the es- timated state-space model and is only dened for state- space models derived from the realization algorithms by Kung 5], Zeiger-McEwen 10] or Juang and Pappa 4].

This measure will now be generalized for arbitrary esti- mation methods and we present a practical method of how to calculate it.

Assume an impulse response

^f

h t

^g

Nt

⁼¹

and a state-space realization ( ABC ) is given. The Modal Coherence is a set of indicators k k = 1 ::: n . Each indicator k

is associated with one eigenvalue k of the state tran-

sition matrix A . The set of indicators k is derived by

1

(2)

comparing the extended observability and controllability matrices from the model ( ABC ) with the extended ob- servability and controllability matrices directly derived from the empirical impulse response without forming an explicit model.

A fundamental problem in the comparison is that the controllability and observability matrices must be con- verted to a unique common basis in order to make any meaningful comparison. In what follows we will intro- duce a state-space realization in a normal form as well as a normal form of a singular value decomposition of a Hankel matrix. These normal forms will form a common unique description basis.

2 Preliminaries

Let the extended observability and controllability matri- ces

^O

i and

^C

j induced by the model (1) be dened as

O

i =

2

6

4

CA C CA ... ⁱ

^;1

3

7

5

(2)

C

j =

B AB

A ^j

^;1

B

(3) with the dimensions

^O

i

² ^R

pi

n and

^C

j

²^R

n

mj . We will also assume minimality of ( ABC ) which implies that, if min( ij )

n ,

^O

i and

^C

j both have rank n .

Denition 2.1 A state-space model ( ABC ) is ij - balanced if the i -extended observability matrix (2) and j -extended controllability matrix (3) satisfy the relations

O

Ti

^O

i =

^C

j

^C

Tj = (4) where = diag(

¹

²

::: n ) with

¹

²

:::

n

0.

²

We note the following property of .

Lemma 2.1 If the system ( ABC ) is minimal then

_n > 0 in (4) if min( ij )

n .

Proof. The minimality and the choice of i and j directly implies that

^O

i and

^C

j both are of rank n . The construc- tion of then directly shows that rank() = n

⁾

n > 0

2

For a minimal system the rank of

^O

_Ti

^O

j and

^C

j

^C

Tj is always n , independently of the basis chosen for the state variables.

Let the measured impulse response

^f

h t

^g

be formed into a Hankel matrix of size pi

mj

H ^ij =

2

6

4

h

¹

h

²

h j

h

²

h

³

h j

⁺¹

... ...

...

h i h i

⁺¹

h i

⁺

j

^;1

3

7

5

(5)

with the singular value decomposition H ^ij =

U

¹

U

²

₀

¹

⁰

2

V

¹

^T V

²

^T

(6) where

¹

= diag(

¹

²

::: _n ) and

¹

²

:::

_n are the n largest singular values.

The state-space model (1) has the impulse response h t = CA ^t

^;1

B t > 0 (7) which gives

H ^ij =

^O

i

^C

j (8) and consequently

²

= 0 in (6) since

^O

i and

^C

j both have rank n .

2.1 A Normal Form for the SVD of the Hankel Matrix

Denition 2.2 In the normal form singular value de- composition of a matrix

H ^ij = U V ^T (9) the three factors U , and V are unique.

²

Consider the SVD (6) of the Hankel matrix H ^ij con- structed from the given impulse response. We can then consider

O

di = U

¹

¹¹

⁼

²

(10) and

C

dj =

¹¹

⁼

²

V

¹

^T (11) respectively, as the controllability and observability ma- trix of a system of order n directly inferred from the given impulse response. Notice that

O

diT

^O

di =

^C

_dj

^C

_djT =

¹

(12) which we, from Denition 2.1, recall as ij -balancing.

Hence we can say that the data-inferred observability and controllability matrices are given in ij -balanced form.

The factorization of the Hankel matrix H ^ij obtained by the SVD (6) is not uniquely dened. By assuming that H ^ij is constructed from the impulse response of an n th order system, we can write

H ^ij = U

¹

V

¹

^T (13) with U

¹² ^R

^pi

ⁿ , U

¹

^T U

¹

= I , V

¹

^T

²^R

ⁿ

^mj , V

¹

V

¹

^T = I ,

¹

= diag(

¹

::: _n ) and

¹

:::

_n . This factor- ization is unique up to a matrix T

²^R

ⁿ

ⁿ , satisfying

T

¹

=

¹

T TT ^T = I: (14)

(3)

If (14) holds, a valid SVD factorization is then also H ^ij = U

¹

V

¹

^T (15) with U

¹

= U

¹

T and V

¹

= V

¹

T . If we limit ourself to consider the generic case of distinct singular values in

¹

, it is easy to show that the matrices satisfying (14) are of the form

T s = diag( s

¹

s

²

::: s n ) (16) where

^f

s _i

^g

are either +1 or

^;

1. We now introduce a normal form for the SVD for the case of distinct singu- lar values by require a positive sign of the rst non-zero element in each column of U

¹

. Let U

¹

^k denote the k th col- umn of U

¹

. A normal form is constructed by determining the signs in the diagonal of T s = diag ( s

¹

::: s n ) in the following way.

8k2

1

^:^:^:ⁿ

]: If the rst non-zero element in

^U¹^k

is negative, take

^s^k

=

^;

1, otherwise

^s^k

= 1. (17) Since U

¹

is always of full rank, at least one element in each column U

¹

^k is non-zero and each sign s k will be unique. A unique SVD factorization of H ^ij will then be H ^ij = U

¹

V

¹

^T (18) where U

¹

= U

¹

T s and V

¹

= V

¹

T s . We summarize the result in a lemma.

Lemma 2.2 Assume that the singular value decompo- sition of H ^ij is given by (13) with distinct singular values

¹

. A normal form SVD (9) is then dened by the steps

(17){(18).

²

2.2 The

^ij

-Balanced Normal Form

We will now continue by deriving a realization in nor- mal form for the state-space system ( ABC ) which will match the SVD factorization above. The realization will be based on the ij -balanced realization from De- nition 2.1.

Given a general minimal system ( ABC ) the follow- ing algorithm 6] will dene a similarity transformation matrix T such that ( T

^;1

ATT

^;1

BCT ) is in ij -balanced form. Begin by forming

Q =

^O

_Ti

^O

i (19) and

P =

^C

j

^C

Tj : (20) Use the Cholesky factorization 3] to obtain

Q = R ^T R (21)

with R

²^R

ⁿ

ⁿ and the SVD

RPR ^T = U

²

U ^T (22)

with U

²^R

ⁿ

ⁿ and U ^T U = I and the diagonal matrix

²^R

ⁿ

ⁿ with = diag(

¹

²

::: _n ) and

¹

²

:::

n . The similarity transformation matrix is then dened by

T = R

^;1

U

¹

⁼

²

(23)

Lemma 2.3 A minimal state-space system ( ABC ) will be transformed to ij -balanced form by the similarity transformation given by (19){(23) if min( ij )

n .

Proof. From the minimality and min( ij )

n , it follows from Lemma 2.1 that both Q and P are positive denite symmetric matrices and hence the calculations (19){(23) are well dened. The transformation directly gives

T ^T

^O

_Ti

^O

i T =

¹

⁼

²

U ^T R

^;

^T R ^T RR

^;1

U

¹

⁼

²

= (24) and

T

^;1^C

j

^C

Tj T

^;

^T =

^;1

⁼

²

U ^T RPR ^T U

^;1

⁼

²

= (25)

which concludes the proof.

²

The ij -balanced form is not a unique realization 9, 2], since state transformations T satisfying

T = T TT ^T = I (26) also results in a new ij -balanced form. A normal form is constructed if this ambiguity is removed.

Denition 2.3 ^{If (} ^A

¹

^B

¹

^C

¹

) is input-output equiv- alent to the realization ( A

²

B

²

C

²

), a normal form is a mapping

M :

^R

ⁿ

^R

ⁿ

^m

^R

^p

ⁿ

^!^R

ⁿ

^R

ⁿ

^m

^R

^p

ⁿ (27) such that

M ( A

¹

B

¹

C

¹

) = M ( A

²

B

²

C

²

) : (28)

2

If two models are given in the normal form, we can compare the system matrices element by element to de- termine system equivalence. A normal form can also be considered as a unique basis for the states of the system.

Consider a minimal state-space realization ( ABC ) which is ij -balanced. Assume all the singular values in

to be distinct and ordered in a descending order ac- cording to Denition 2.1. The basis of the states of the ij -balanced form is then unique up to similarity transfor- mations T s (16). By uniquely determining the n signs of the diagonal element of T s a normal form will emerge as ( T s AT s T s BCT s ). This can be accomplished as follows.

Construct the extended observability matrix

^O

i . Each

(4)

column of

^O

i have at least one non-zero element since the system is observable. The diagonal elements of T _s are uniquely given by

8k2

1

^:^:^:ⁿ

]: If the rst non-zero element in

^Oⁱ^k

is negative, take

^sk

=

^;

1, otherwise

^sk

= 1. (29) In (29)

^O

_ki is the k th column of

^O

i .

Lemma 2.4 The ij -balancing algorithm (19){(23) re- sulting in the minimal state-space description ( ABC ) and the sign determination of the transformation matrix T s (29), result in a normal form ( T s AT s T s BCT s ), pro- vided that the diagonal elements of in (4) are distinct.

Proof. Given the ij -balanced realization, with hav- ing distinct singular values, it is easy to show that the only state transformation matrices which preserves the ij -balanced form is of the form T s (16). The minimal- ity implies that

^O

i have full rank and thus, each column of

^O

i have a non-zero element. The introduction of the positivity condition of the rst non-zero element of each column of the extended observability matrix

^O

i , makes

the realization unique.

²

The case of non-distinct diagonal elements in can also be handled if additional steps are included, see the proof of Theorem 2.1 in 7]. The generic case, however, is distinct singular values and this will suce for our pur- poses.

Theorem 2.1 Consider an ij -balanced normal form ( ABC ) with the extended observability and controllabil- ity matrices

^O

i and

^C

j , respectively, and a normal form SVD factorization (18) of a Hankel matrix H ^ij with dis- tinct singular values

¹

. Then

O

i = U

¹

¹¹

⁼

²

and

^C

j =

¹¹

⁼

²

V

¹

^T (30) if and only if H ^ij is constructed from the impulse response of the system ( ABC ).

Proof. ^Assume ^H ^ij is constructed from the system ( ABC ). It then follows that

H ^ij =

^O

i

^C

j : (31) From Denition 2.1 we have the relations

O

Ti

^O

i =

^C

j

^C

Tj = : (32) From the singular value decomposition

O

i = ~ U o ~

¹

⁼

²

V ~ _To (33) note that

=

^O

_Ti

^O

i = ~ V o ~~ V _To (34) and hence ~ = . was assumed to have distinct or- dered elements on the diagonal which then implies that

V ~ _o = ~ T _s for some sign matrix ~ T _s . From Lemma 2.4 it follows that the rst non-zero element in each column in

O

Ti is positive. If we impose the same condition on ~ U o

we conclude that ~ T s = I . By similar arguments we can write

C

j =

¹

⁼

²

V ~ _Tc : (35) This results in a factorization

H ^ij =

^O

i

^C

j = ~ U _o ~ V _Tc : (36) Since the rst non-zero element in each column of ~ U o is positive we conclude, according to Lemma 2.2, that this is a normal form SVD of the Hankel matrix. Hence

=

¹

U ^~ o = U

¹

and ~ V c = V

¹

(37) which proves the if part. The only if part is trivial since (30) implies

H ^ij =

^O

i

^C

j (38)

which concludes the proof.

²

The above result indicates a way of comparing implicit models given in impulse response form with models in state-space form. We will now consider such a quality measure which can be calculated for each eigenvalue of A and hence can be associated with the modes of the system.

3 Modal Coherence Indicators

Based on the two derived normal forms we are now ready to generalize the Modal Amplitude Coherence introduced in 4]. The method compares the modes from an esti- mated state-space model with the information contained in an impulse response. We will do this by using the normal forms and diagonalize the system to obtain a de- coupling of the dierent modes.

Consider a state-space dynamic system ( ABC ) such that A can be diagonalized,

9

non-singular T

²^C

ⁿ

ⁿ T

^;1

AT = (39) where = diag (

¹

²

::: n ) and i

² ^C

are the eigenvalues of A . Let us use this transformation to obtain a state-space realization with complex matrices

( B _c C _c ) = ( T

^;1

ATT

^;1

BCT ) (40) This realization is particularly interesting since the modes (or frequencies) of the systems can be associated with the diagonal elements of , which are the poles of the system.

Furthermore, row k of the B c matrix and column k of the C c matrix are associated with the k th pole of the system.

We can thus write the transfer function as the sum G ( z ) =

^X

ⁿ

k

⁼¹

z

^;

1 k C _kc B _kc (41)

(5)

where the superscript denotes column and row numbers, respectively.

Recall that the model ( ABC ) given in ij -balanced normal form has the observability and controllability ma- trices

^O

i and

^C

j , respectively. If T is the complex similar- ity transformation which diagonalizes A , the correspond- ing complex observability and controllability matrices are

O

ci =

^O

i T

^C

_cj = T

^;1^C

j : (42) Note that the k th column of

^O

_ci , as well as the k th row of

^C

_cj , are associated with the k th eigenvalue k .

Consider the normal form SVD factorization (18) of the Hankel matrix H ^ij . If the Hankel matrix is constructed from the impulse response of ( ABC ), it is clear from Theorem 2.1 that

O

ci = U

¹

¹¹

⁼

²

T and

^C

_cj = T

^;1

¹¹

⁼

²

V

¹

^T (43) where T is given by (39). In practice when H ^ij is com- posed of the measured impulse response (43) does not exactly hold and we can measure the deviation between the model and the impulse response by calculating the cosine of the angle between the n columns and rows re- spectively. This leads to the following denition.

Denition 3.1 The modal observability coherence _ok is dened for k = 1 ::: n as

_ok =

^j

(

^O

_ci e k ) ^H U

¹

¹¹

⁼

²

Te k

^j

jO

ci e k

^jj

U

¹

¹¹

⁼

²

Te k

^j

: (44) The modal controllability coherence _ck is dened k = 1 ::: n as

_ck =

^j

e _Tk

^C

_cj ( e _Tk T

^;1

¹¹

⁼

²

V

¹

^T ) ^H

^j

j

e _Tk

^C

_cj

^jj

e _Tk T

^;1

¹¹

⁼

²

V

¹

^T

^j

(45) where

^O

_ci and

^C

_cj are given by (42) and U

¹

and V

¹

are given by (18), T by (39), e _k is a unit vector with a one in position k and (

) ^H denotes the conjugate transpose.

Furthermore, the Modal Coherence Indicator is dened for k = 1 ::: n as

k = _ck _ok (46)

2

Given a state-space model and an impulse response, the set

^f

k

^g

nk

⁼¹

of Modal Coherence Indicators describes the \coherence" between the state-space model and the impulse response and thus serves as a distributed model quality measure for the modes in the model ( ABC ).

The obvious properties of _k which follows from Theo- rem 2.1 are:

Corollary 3.1 The properties of the Modal Coherence Indicators from Denition 3.1 are

k

²

0 1]

⁸

k

²

1 2 ::: n ] (47)

and if the Hankel matrix H ^ij and the system ( ABC ) share the same impulse response coecients we have

k = 1

⁸

k

²

1 2 ::: n ] : (48) The modal controllability coherence from Deni- tion 3.1 is equal to the Modal Amplitude Coherence in- troduced in 4] if the state-space model ( ABC ) origi- nates from the algorithm by Zeiger-McEwen or the ERA- algorithm. The novelty of the introduced method is that given any state-space model and an impulse response, the Modal Coherence Indicators are well dened.

From the denition it follows that n Modal Coherence Indicators (MCIs) can be calculated from a system of order n . For a complex conjugate eigenvalue pair the two corresponding MCIs will be equal. For one vibrational mode we thus only need to consider one MCI. For the real eigenvalues each eigenvalue has a unique MCI.

The derived indicators have successfully been used for validation of vibrational models of an aircraft 1, 8] and a satellite separation system 8].

3.0.1 Stabilization Diagram

The Modal Coherence Indicators (46) can be used as a quality tag attached to each of the vibrational modes of an estimated model. Each vibrational mode is character- ized, among other things, by the frequency. The indica- tors k can be used in a diagram to visualize frequency location and modal accuracy. Such a diagram is called a stabilization diagram. In a stabilization diagram the horizontal axis represents frequency and the vertical axis Modal Coherence. A vertical bar of length k is plot- ted for each mode at the frequency location given by the mode. If several models of dierent orders are available the corresponding bar graphs can be stacked on top of each other, starting with the model with the lowest or- der. The characteristics of the diagram is that for identi-

ed models with a too low model order some frequen- cies (modes) give rise to low Modal Coherence Indicators (the bars are short). For higher order models these split into two or more modes with higher indicators (or disap- pear again). As the model order increases the identied frequencies \stabilize", hence the name stabilization dia- gram.

Example 3.1 An impulse response of a linear system of order n = 8 is given at 500 sample points. The system has 4 oscillative modes of varying frequencies. The impulse response is corrupted by additive zero mean Gaussian noise with variance 10

^;2

.

7 models with model orders 2 4 ::: 14 respectively are

estimated. In Figure 1 a stabilization diagram is shown

indicating the identied resonance frequencies and the

corresponding Modal Coherence Indicator (MCI). From

the diagram we see that the low frequency modes (be-

low 0.1 Hz) are accurately estimated by the lower order

(6)

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 2

4 6 8 10 12 14 16

Frequency (Hz)

Model Order n

Stabilization Diagram

Figure 1: Stabilization diagram indicating identied res- onance frequencies and corresponding Modal Coherence Indicator (MCI). For each model order a bar graph is shown. The location of the bars along the horizontal axis is determined by the frequencies of the modes from the model. The length of each bar is equal to the correspond- ing MCI k . In the gure the bar graphs for dierent model orders are placed on top of each other and the ver- tical axis show the model order. In the diagram models of orders 2 to 14 are shown.

models. The model of order 6 (3 resonant modes) have a MCI signicantly lower than 1 for the highest frequency.

At model order 8 (4 resonant modes) this indicator splits up into two with high MCI. At model order 8 we see that all four indicators are high. A further increase of the model order results in low indicators for the new reso- nance frequencies as well as for some of the old. Based on the diagram we would thus prefer a model order of 8 (4 resonant modes) where each mode is of high quality.

2

4 Conclusions

The Modal Coherence Indicators (MCIs) presented in this paper gives a complement to traditional model val- idation techniques based on a direct comparison of the time domain signals. The indicators are derived by com- paring the model induced observability and controllabil- ity matrices with the data induced ones using a common basis. Particularly when it is of importance to put a qual- ity tag on each mode (eigenvalue) of the model, the MCIs provide some explicit quality information.

References

1] T. Abrahamsson, T. McKelvey, and L. Ljung. A study of some approaches to vibration data analy-

sis. In Proc. of the 10th IFAC Symposium on Sys- tem Identication, volume 3, pages 289{294, Copen- hagen, Denmark, July 1994.

2] K. Glover. All optimal Hankel-norm approximations of linear multivariable systems and their L

¹

-error bounds. Int. J. Control, 39(6):1115{1193, 1984.

3] G. H. Golub and C. F. Van Loan. Matrix Compu- tations. The Johns Hopkins University Press, Balti- more, Maryland, second edition, 1989.

4] J. N. Juang and R. S. Pappa. An eigensystem real- ization algorithm for modal parameter identication and model reduction. J. of Guidance, Control and Dynamics, 8(5):620{627, 1985.

5] S. Y. Kung. A new identication and model reduc- tion algorithm via singular value decomposition. In Proc. of 12th Asilomar Conference on Circuits, Sys- tems and Computers, Pacic Grove, CA, pages 705{

714, 1978.

6] A. J. Laub. Computation of balancing transforma- tions. In Proc. JACC, San Francisco, pages paper FA8{E, 1980.

7] J. M. Maciejowski. Balanced realizations in system identication. In Proc. 7'th IFAC Symposium on Identication & parameter Estimation, York, UK, 1985.

Model Validation by Geometric Arguments