• No results found

Identication for Control: Some Results on Optimal Experiment Design

N/A
N/A
Protected

Academic year: 2021

Share "Identication for Control: Some Results on Optimal Experiment Design"

Copied!
15
0
0

Loading.... (view fulltext now)

Full text

(1)

Identication for Control: Some Results on Optimal Experiment Design

Urban Forssell and Lennart Ljung Department of Electrical Engineering Linkping University, S-581 83 Linkping, Sweden

WWW:

http://www.control.isy.l iu.s e

Email:

ufo@isy.liu.se, ljung@isy.liu.se

February 26, 1998

REGLERTEKNIK

AUTOMATIC CONTROL LINKÖPING

Report no.: LiTH-ISY-R-2014 Submitted to CDC '98

Technical reports from the Automatic Control group in Linkping are available by anonymous

ftp at the address

ftp.control.isy.liu.se

. This report is contained in the compressed

postscript le

2014.ps.Z

.

(2)

Identication for Control: Some Results on Optimal Experiment Design

Urban Forssell and Lennart Ljung Division of Automatic Control Department of Electrical Engineering Linkoping University, S-581 83 Linkoping, Sweden

E-mail: ufo@isy.liu.se, ljung@isy.liu.se URL: http://www.control.isy.liu.se/

February 26, 1998

Abstract

The problem of designing the identication experiments to make them maximally informative with respect to the intended use of the model is studied. A focus will be on model based control and we show how to choose the feedback regulator and the spectrum of the reference signal in case of closed-loop experiments. A main result is that when only the mist in the dynamics model is penalized and when both the input and the output power are constrained then the optimal controller is given by the solution to a standard LQ problem. When only the input power is constrained, it is shown that open- loop experiments are optimal. Some examples are also given to exemplify the theoretical results.

1 Introduction

The validation step in the identication procedure should be devoted to evaluating the model with respect to its intended use simulation, prediction, control, etc. To make fair judgments it is important to know what factors aect the model's performance and how these factors can be manipulated. In this paper we focus on experiment design issues and especially how one should choose the feedback regulator in closed-loop experiments. The aim will be to design the experiments so as to minimize the variance errors in the identied models. This route was also taken in 2] and 4], and the presentation here will be inspired by these papers. See also 7].

The main drawback with the optimal solutions that will be derived is that they typically will depend on unknown quantities, like the unknown system that we are trying to identify.

Clearly this will lead to infeasible designs. This problem is inevitable and shared with any optimal design problem. A way around this is to use iterations and to replace the true system in the optimal solutions with the current estimate of the system.

Despite this problem the optimal solutions derived here give important and useful insights

into questions like \Should open-loop or closed-loop experiments be used?" and \To which

frequency ranges should the input signal power be concentrated?".

(3)

The rest of the presentation will unveil as follows. Section 2 contains some background material on the identication method used and the assumptions made on the underlying system. The notation will also be introduced in this section. In Section 3 we provide a motivation for the type of criterion we will consider. A key issue is that the criterion will reect the variance error introduced when using identied models in the control design. Section 4 contains the main results and in Section 5 these results are applied to some well known control design schemes like internal model control and minimum variance control. Conclusions are given in Section 6.

2 Preliminaries

To evaluate a model's performance one needs some notion of the \best" or \true" model. We will assume that the true system is single-input single-output (SISO) and given by

y

(

t

) =

G0

(

q

)

u

(

t

) +

v

(

t

)

v

(

t

) =

H0

(

q

)

e

(

t

) (1)

Here

fe

(

t

)

g

is white noise with variance

0

. To simplify the notation we introduce the transfer matrix

T

0

(

q

) =

G0

(

q

)

H

0

(

q

)



(2) Given input-output data

ZN

=

fy

(1)

u

(1)

:::y

(

N

)

u

(

N

)

g

and a parameterized model structure

^

y

(

tj

) =

H;1

(

q

)

G

(

q

)

u

(

t

) + (1

;H;1

(

q

))

y

(

t

) (3) we can estimate the model by the straightforward t:

^



N

= argmin

 V

N

(



) (4)

V

N

(



) =

XN

t=1

"

2

(

t

) (5)

"

(

t

) =

y

(

t

)

;y

^ (

tj

) =

H;1

(

q

)(

y

(

t

)

;G

(

q

)

u

(

t

)) (6) For more details on this identication method see, e.g., 7]. The model structure (3) may also be represented as

y

(

t

) =

G

(

q

)

u

(

t

) +

H

(

q

)

e

(

t

) (7) In the sequel

G

(

q

) and

H

(

q

) will be called the dynamics model and the noise model, respectively. Let ^

GN

(

q

) =

G

(

q

^

N

) and ^

HN

(

q

) =

H

(

q

^

N

) and also introduce

^

T

N

(

q

) = ^

GN

(

q

)

^

H

N

(

q

)



(8) The error between the true transfer matrix

T0

and the estimated one ^

TN

will be denoted ~

TN

,

~

T

N

(

q

) =

T0

(

q

)

;T

^

N

(

q

) = ~

GN

(

q

)

~

H

N

(

q

)



(4)

with an obvious notation for ~

GN

and ~

HN

. As the number of data tends to innity and the model order

n

increases we have that (cf. 5], 9])

Cov ^

TN

(

ei!

)

 n

N

P

(

!

) =

n

N



v

(

!

) 

u

(

!

) 

ue

(

!

)



ue

(

;!

)

0



;T

(9)

In this paper we will assume that the input is generated as

u

(

t

) =

r

(

t

)

;K

(

q

)

y

(

t

) (10) and that the reference signal

r

is uncorrelated with the disturbance

e

, i.e., that the cross spectrum 

re

= 0 . We can then split the input spectrum into two parts, one that is due to

r

and one that is due to

e

:



u

(

!

) = 

ru

(

!

) + 

eu

(

!

) (11) By combining (1) and (10) we see that the input can be written

u

(

t

) =

S0

(

q

)

r

(

t

)

;K

(

q

)

S0

(

q

)

H0

(

q

)

e

(

t

) (12) Consequently



ru

(

!

) =

jS0

(

ei!

)

j2



r

(

!

) 

eu

(

!

) =

jK

(

ei!

)

j2jS0

(

ei!

)

j2jH0

(

ei!

)

j20

(13)



ue

(

!

) =

;K

(

ei!

)

S0

(

ei!

)

H0

(

ei!

)

0

(14) With these denitions we may rewrite

P

as

P

(

!

) = 

v

(

!

) 

u

(

!

) 

ue

(

!

)



ue

(

;!

)

0



;T

= 

v

(

!

)



0



u

(

!

)

;j



ue

(

!

)

j2 0 ;



ue

(

!

)

;



ue

(

;!

) 

u

(

!

)



T

=

jH0

(

ei!

)

j2



ru

(

!

)

0 ;



ue

(

!

)

;



ue

(

;!

) 

ru

(

!

) + 

eu

(

!

)



T

(15)

Here we have used the fact that



v

(

!

) =

0jH0

(

ei!

)

j2

(16) A remark regarding (15) is that this result holds not only for the direct method, considered here, but also for the so called indirect and joint input-output approaches to closed-loop identication. See, e.g., 3] and 1]. Therefore all the results below will also apply to these approaches as well.

For future use we also note that the output spectrum can be written



y

(

!

) =

jG0

(

ei!

)

j2



ru

(

!

) +

jS0

(

ei!

)

j2



v

(

!

) (17)

(5)

3 Measuring the Performance Degradation

Our focus here will be on model-based control. It is then natural to evaluate the model's performance by comparing the outputs of the closed-loop system resulting from control design using the \true" models and the estimated ones, respectively. In general this output will be a function of both

G

and

H

and some driving signal

w

:

y

(

t

) =

f

(

T

(

q

))

w

(

t

) (18) The \true" or \best" signal is

y

0

(

t

) =

f

(

T0

(

q

))

w

(

t

) (19) while the one resulting from the estimated models is

^

y

N

(

t

) =

f

( ^

TN

(

q

))

w

(

t

) (20) A measure of the performance degradation can be obtained by computing the mean square error

Ey

~

2N

(

t

) =

E

(

y0

(

t

)

;y

^

N

(

t

))

2

(21) Using rst-order approximations we may write

~

y

N

(

t

) = ~

TNT

(

q

)

F

(

q

)

w

(

t

) (22) where

F

(

q

) =

@

@T f

(

T

)



T=T0(q)

(23) From the denition of the spectrum it follows that

Ey

~

2N

(

t

) = 12

Z;



~yN

(

!

)

d!

= 12

Z;

tr !(

!

)

C

(

!

)]

d!

(24) where

!(

!

) =

ET

~

N

(

e;i!

) ~

TNT

(

ei!

) (25)

C

(

!

) =

F

(

ei!

)

w

(

!

)

FT

(

e;i!

) (26) Thus, if we want to minimize the performance degradation due to the use of identied models, a natural criterion to minimize would be

"

J

=

Z



;

tr !(

!

)

C

(

!

)]

d!

(27)

Let us comment on the matrix functions ! and

C

. First, if the bias errors are negligible then

! will be the (transpose of the) covariance matrix of the estimate ^

TN

,

!(

!

) =

h

Cov ~

TN

(

ei!

)

iT

(28)

(6)

The error made in this approximation should be small if the model structure is reasonably

exible. Using the standard result (9) we thus have

!(

!

)

 n

N P

T

(

!

) =

n

N



v

(

!

) 

u

(

!

) 

ue

(

!

)



ue

(

;!

)

0



;1

(29)

Second, the weighting function

C

should reect the relative importance of a good t in

G

and

H

, respectively, for dierent frequencies. We shall assume that

C

is Hermitian and positive semi-denite as in (30). In general we may write

C

(

!

) =

C11

(

!

)

C12

(

!

)

C

12

(

;!

)

C22

(

!

)



(30) If we neglect the scaling

n=N

(which will have the aect that the prejudice against high order models is reduced), the criterion (27) simplies to

J

=

Z



;

tr

P

(

!

)

C

(

!

)]

d!

(31)

The route taken here follows the one in 2].

4 Main Results

In this section we will consider experiment design problems where the goal is to minimize

J

with respect to the design variables, which we take as

K

and 

r

. Other equivalent choices are also possible, e.g., 

u

and 

ue

or

K

and 

ru

. To make the designs realistic we will also impose constraints on the input power or the output power, or both. We will also discuss the case where power of the reference signal is bounded. Without loss of generality we will take the upper bounds to be unity. This is done to simplify the notation and can always be achieved by suitable scaling. In the sequel we also will omit the arguments

!

and

ei!

whenever there is no risk of confusion.

First consider the case where penalty only is put on the mist in

G

. This corresponds to

C

12

=

C22

= 0 . We then have the following result which to our knowledge is new.

Theorem 1. The solution to the problem min

K r

Z



;

tr

PC

]

d!

:

C12

= 0

 C22

= 0 

Z 

;

f



u

+ (1

;

)

ygd!

1

 2

0



1]



(32) is

K

opt

= arg min

K Z



;

jKj

2

+ (1

;

)

j

1 +

G0Kj2



vd!

(33)



optr

=

p



vC11 j

1 +

G0Koptj2

p

+ (1

;

)

jG0j2

(34) where

is a constant, adjusted so that

Z



;

f



u

+ (1

;

)

ygd!

= 1 (35)

(7)

Remark: It should be noted that equation (33), dening the optimal controller

Kopt

, is a standard LQ problem. If

G0

and 

v

were known this could be easily solved to give an explicit expression for

Kopt

.

Proof. Replace the design variables

K

and 

r

by the equivalent pair

K

and 

ru

. Then, by using (15), (11), and (17), problem (32) can be rewritten as

min

K 

r

u

Z



;



v



ruC11d!

:

Z 

;

f

(

+(1

;

)

jG0j2

)

ru

+

jKj2

+ (1

;

)

j

1 +

G0Kj2



vgd!

1

 2

0



1]



(36) The criterion function is independent of

K

hence the optimal controller

Kopt

can be found by solving the LQ problem

min

K Z



;

jKj

2

+ (1

;

)

j

1 +

G0Kj2



vd!

(37)

(Here it is implicitly assumed that

y

(

t

) =

G0

(

q

)

u

(

t

)+

v

(

t

)

 u

(

t

) =

;K

(

q

)

y

(

t

), and

2

0



1].) This proves (33). Dene the constant

as

= 1

;

Z



;

jK opt

j

2

+ (1

;

)

j

1 +

G0Koptj2



vd!

(38) Problem (36) now reads

min

 r

u f

Z



;



v



ruC11d!

:

Z



;

(

+ (1

;

)

jG0j2

)

rud! g

(39) This problem has the solution (cf. 7], p. 376)



ru

=

s



vC11

(

+ (1

;

)

jG0j2

) (40)

where

is a constant, adjusted so that

Z



;

((

+ (1

;

)

jG0j2

)

rud!

=

(41) or in other words so that

Z



;

f



u

+ (1

;

)

ygd!

= 1 (42) Consequently the optimal 

r

is



optr

=

p



vC11 j

1 +

G0Koptj2

p

+ (1

;

)

jG0j2

(43)

which ends the proof.

(8)

We stress that the optimal controller

Kopt

in (33) can easily be found by solving the indicated discrete-time LQ problem (if

G0

and 

v

were known). Among other things this implies that the optimal controller

Kopt

is guaranteed to stabilize the closed-loop system and be linear, of the same order as

G0

. This is a clear advantage over the results reported in, e.g., 4] and also over the results in Theorem 3 below. Furthermore, the optimal controller is independent of

C11

which also is quite interesting and perhaps somewhat surprising. This means that whatever weighting

C11

is used in the design criterion, it is always optimal to use the LQ regulator (33) in the identication experiment.

From Theorem 1 we also see that closed-loop experiments are optimal as long as the is a constraint on the output power, i.e., as long as

6

= 1. If

= 1 then

Kopt

= 0 and the optimal input spectrum 

optu

(= 

optr

) becomes



optu

=

p



vC11

(44)

This side result can in fact be strengthened 7]:

Theorem 2. The problem min

K r

Z



;

tr

PC

]

d!

:

Z



;



ud!

1

 C12

= 0



(45) has the solution

K

opt

= 0 (open-loop operation) (46)



optr

= 

optu

=

p



vC11

(47)

where

is a constant, adjusted so that

Z



;



ud!

= 1 (48)

Consider now the case where the power of the reference signal is constrained. The following theorem, which we believe is new, then states what the optimal controller and reference spectrum are. The expression for the optimal controller also holds for the case where the reference signal is xed and pre-specied (which would be the case if data were collected under normal (controlled) operation of the plant).

Theorem 3. The problem min

K r

Z



;

tr

PC

]

d!

:

Z



;



rd!

1



(49) has the solution

K

opt

=

; C11G0

+

C12H0

C

11 jG

0 j

2

+ 2Re(

C12G0H0

) +

C22jH0j2

(50)



optr

=

s



v

det

CjH0j2

C

11 jG

0 j

2

+ 2Re(

C12G0H0

) +

C22jH0j2

(51) where

is a constant, adjusted so that

Z



;



rd!

= 1 (52)

(9)

Remark: Note that

Kopt

in (50) only is a feasible design if

Kopt

is proper and stabilizes the closed-loop system. Unfortunately no such guarantees can be given, which is a clear disadvantage compared to the result in Theorem 1. The optimal solutions derived in 4]

suer from the same type of problems, as mentioned earlier. To circumvent this problem it was suggested in 4] that the so called Youla parameterization 8] should be used. This idea could lead to interesting extensions of the results in Theorem 3 also, although we will not study it any further here.

Proof. Introduce the following square-root factorization of

C

:

C

=

C

0

1 CC23



C

1 C

0

C23





(53)

Clearly we have the relations

C

11

=

jC1j2

+

jC2j2

(54)

C

12

=

C2C3

(55)

C

22

=

jC3j2

(56)

With

P

given by (15) we have that

tr

PC

] =

C22jH0j2

+ # (57)

where

# =  

vru



C

11

+ 2Re(

C12 KS0H0

) +

C22jKj2jS0j2jH0j2

(58)

=  

vru



jC

1 j

2

+

jC2

+

C3KS0H0j2

(59)

=  

vr



jC

1

(1 +

KG0

)

j2

+

jC2

(1 +

KG0

) +

C3KH0j2

(60)

=  

vrk



C

1 C

2



+

KC1G0 C2G0

+

C3H0k2

(61) There is no constraint on

K

so to minimize the criterion function we should, at each frequency, choose

K

as the least-squares estimate

K

opt

=

;



C

1 C

2



C

1 G

0 C

2 G

0

+

C3H0

k



C

1 G

0 C

2 G

0

+

C3H0k2

(62)

=

;

(

jC1j2

+

jC2j2

)

G0

+

C2C3H0

(

jC1j2

+

jC2j2

)

jG0j2

+ 2Re(

C2C3G0H0

) +

jC3j2jH0j2

(63)

=

; C11G0

+

C12H0

C

11 jG

0 j

2

+ 2Re(

C12G0H0

) +

C22jH0j2

(64) With this choice of

K

it can be shown (by straightforward but tedious calculations) that

tr

PC

]



K=K

opt

=

C22jH0j2

+  

vr

det

CjH0j2

C

11 jG

0 j

2

+ 2Re(

C12G0H0

) +

C22jH0j2

(65)

(10)

The rst term is independent of 

r

, hence we get (cf. 7], p. 376)



optr

=

s



v

det

CjH0j2

C

11 jG

0 j

2

+ 2Re(

C12G0H0

) +

C22jH0j2

(66) where

is a constant, adjusted so that

Z



;



optr d!

= 1 This proves the theorem.

Note that if

C

is singular the optimal 

r

in (51) can be chosen at will, as long as

Z



;



optr d!

= 1 (67)

This is due to the factor det

C

in (51) which vanishes when

C

is singular. Furthermore, when

C

is singular the optimal

K

in (50) simplies to

K

opt

=

; C12

C

12 G

0

+

C22H0

(68)

We have thus established the following theorem which is a variant of a result given in 6]. See also 2].

Theorem 4. If

C

is singular then the problem min

K 

r

Z



;

tr

PC

]

d!

: any constraint



(69) has the solution

K

opt

=

; C12

C

12 G

0

+

C22H0

(70)



optr

arbitrary (71)

as long as

u

given by

u

(

t

) =

r

(

t

) +

C12

(

q

)

C

12

(

q

)

G0

(

q

) +

C22

(

q

)

H0

(

q

)

y

(

t

) (72) is an admissible input.

5 Examples

5.1 Internal Model Control

In Internal Model Control (IMC) the control law is chosen as

u

(

t

) =

K

(

q

)(

rc

(

t

)

;y

(

t

)) =

Q

(

q

)

1

;G

(

q

)

Q

(

q

)(

rc

(

t

)

;y

(

t

)) (73)

(11)

where

Q

is some stable transfer function and

G

a model of the true plant

G0

. Here

rc

denotes the reference signal to be used in the control application, not to be mixed up with the reference signal

r

used in the identication experiment. Note that the control design will not depend on the noise model. Also note that the designed closed-loop system is

G des

c

(

q

) =

G

(

q

)

K

(

q

)

1 +

G

(

q

)

K

(

q

) =

G

(

q

)

Q

(

q

) (74) For this 1-dof regulator

Gc

coincides with the complementary sensitivity function. From (74) we see that

Q

=

Gc=G

should be large where a raise in the system's gain is desired.

Ideally, that is when

G

=

G0

, the \best" or \true" IMC controller results:

K

0

(

q

) =

Q

(

q

)

1

;G0

(

q

)

Q

(

q

) (75)

In general the model

G

is obtained through identication using

N

data samples. In that case

G

= ^

GN

and the corresponding controller is

^

K

N

(

q

) =

Q

(

q

)

1

;G

^

N

(

q

)

Q

(

q

) (76)

Neglecting the response from the noise source

e

we obtain the \true" output

y0

as

y

0

(

t

) =

G0

(

q

)

K0

(

q

)

1 +

G0

(

q

)

K0

(

q

)

rc

(

t

) =

G0

(

q

)

Q

(

q

)

rc

(

t

) (77) The actual output resulting from use of the suboptimal controller (76) is

^

y

N

(

t

) =

G0

(

q

) ^

KN

(

q

)

1 +

G0

(

q

) ^

KN

(

q

)

rc

(

t

) =

G0

(

q

)

Q

(

q

)

1 + ~

GN

(

q

)

Q

(

q

)

rc

(

t

) (78) A measure of the performance degradation can be obtained by studying the error signal

~

y

N

=

y0;y

^

N

. We have

~

y

N

(

t

) =

G0

(

q

)

Q

(

q

) 1

;

1 1 + ~

GN

(

q

)

Q

(

q

)



r

c

(

t

) (79)

=

G

~

N

(

q

)

1 + ~

GN

(

q

)

Q

(

q

)

G0

(

q

)

Q2

(

q

)

rc

(

t

) (80) Using rst-order approximations we get

~

y

N

(

t

)

G

~

N

(

q

)

G0

(

q

)

Q2

(

q

)

rc

(

t

) (81) and it follows that the resulting weighting matrix

C

in this case is given by

C

IMC

=

jG0j2jQj4



rc

0

0 0



(82) Suppose now that we want to solve the problem

min

K r

E

"

y

~

2N

(

t

) :

Z



;

f



u

+ 

ygd!<1



(83)

(12)

where

K

and

r

refers to the set-up (10) used in the identication. Theorem 1 then states that the optimal strategy is to use closed-loop experiments with

K

and 

r

chosen as

K

opt

= arg min

K Z



;

jKj 2

+ 1

j

1 +

G0Kj2



vd!

(84)



optr /p



v



rcjG0jjQj2j

1 +

G0Koptj2

p

1 +

jG0j2

(85)

A remark on this choice of reference spectrum is that (for xed

G0

and 

v

) 

optr

will be large where either of 

rc

or

jQjjGdesc =G0j

, is large.

5.2 Generalized Minimum Variance Control

In Generalized Minimum Variance Control (GMVC) the goal is to make the closed-loop system behave like

y

0

(

t

) =

R

(

q

)

e

(

t

) (86) This can be accomplished by using a regulator of the form

u

(

t

) =

;H0

(

q

)

;R

(

q

)

G

0

(

q

)

R

(

q

)

y

(

t

) (87) It can be shown that the degradation signal ~

yN

can be written (neglecting errors of order higher than 2)

~

y

N

(

t

) =

G

~

N

(

q

) ~

HN

(

q

)



"

H

0

(q);R(q)

G

0 (q)

;

1

#

R

(

q

)

H

0

(

q

)

e

(

t

) (88) leading to a

C

-matrix of the form

C

GMVC

=

0







 R

H

0







 2

"

jH

0

;Rj 2

jG

0 j

2

; H0;R

G0

; H

0

;R

G

0

1

#

(89) Note that

CGMVC

is singular. Hence the choice

K opt

y

=

; ;

(

H0;R

)

=G0

;

(

H0;R

) +

H0

=

H0;R

G

0 R

(90) minimizes

"

Ey

~

N2

(

t

) (91)

subject to any constraint according to Theorem 4. In other words, the optimal controller is

the true GMVC controller. This was rst shown in 2].

References

Related documents

Starting with the data of a curve of singularity types, we use the Legen- dre transform to construct weak geodesic rays in the space of locally bounded metrics on an ample line bundle

46 Konkreta exempel skulle kunna vara främjandeinsatser för affärsänglar/affärsängelnätverk, skapa arenor där aktörer från utbuds- och efterfrågesidan kan mötas eller

Coad (2007) presenterar resultat som indikerar att små företag inom tillverkningsindustrin i Frankrike generellt kännetecknas av att tillväxten är negativt korrelerad över

Parallellmarknader innebär dock inte en drivkraft för en grön omställning Ökad andel direktförsäljning räddar många lokala producenter och kan tyckas utgöra en drivkraft

In conclusion, the material that was collected for the case study of http://www.dn.se conveys an understanding of the now that is both deeply rooted in the past and full of messages

However, protein intake expressed as z-scores, decreased over the 9 first days of life and protein intakes was significantly lower during days 5-7 after

The overall aim of this thesis was to explore energy and nutritional intake in infants, children and adolescents with complex CHD or hsPDA, as well as investigate growth

Industrial Emissions Directive, supplemented by horizontal legislation (e.g., Framework Directives on Waste and Water, Emissions Trading System, etc) and guidance on operating