• No results found

These schemes iteratively perform plant model identication and model-based controller update in the closed loop

N/A
N/A
Protected

Academic year: 2021

Share "These schemes iteratively perform plant model identication and model-based controller update in the closed loop"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

ITERATIVE IDENTIFICATION AND CONTROL

DESIGN SCHEMES

1

Hakan Hjalmarsson#, Svante Gunnarsson# and Michel Gevers$

#Dept. of Electrical Engineering, Linkoping University, S-581 83 Linkoping, Sweden

$CESAME, Louvain University, B-1348 Louvain-la-Neuve, Belgium Abstract

We demonstrate that some recently proposed itera- tive identication and control design schemes do not necessarily converge to a local minimum of the design objective in the case of a restricted complexity model.

There is, however, a link between these approaches and a recently proposed iterative optimization based control design procedure based on experimental data.

We show that if the achieved and the desired output responses are perfectly matched, the schemes are (es- sentially) equivalent under noise free conditions.

1. Introduction

Recently, so called iterative identication and control design schemes have received considerable attention.

See e.g. 1], 2] and 3]. These schemes iteratively perform plant model identication and model-based controller update in the closed loop. The work in 4]

and 5] is a continuation of these ideas, and there it is shown that for certain control criteria,e.g. LQG, it is possible to carry out the optimization using measure- ments from the plant collected during (essentially) normal operating conditions. No models of the plant and the disturbance are required. In this contribu- tion we show that there is a close relation between the optimization based tuning algorithm in 4] and the indirect schemes proposed for example in 6] and

1]. When Gauss-Newton steps are used, the opti- mization procedure can be approximately expressed as an identication criterion. These two identication criteria become identical only if the achieved and de- signed closed loops perfectly match each other, i.e.

only if the true system has been perfectly identied.

We show that when the model is too simple, the indi-

1This paper presents research results of the Belgian Pro- gramme on Interuniversity Poles of Attraction, initiated by the Belgian State, Prime Minister's Oce for Science, Tech- nology and Culture. The scientic responsibility rests with its authors.

rect schemes fail to get close to the minimum of the control criterion.

2. Criterion minimization

Let the true system be given by

y(t) =G0(q)u(t) +v(t) (1) where fv(t)g is a (process) disturbance. The out- put, fy(t)g, from the true system will be called the achieved response. We will use the following two de- grees of freedom controller:

u(t) =Cr(q)r(t);Cy(q)y(t) (2) wherefr(t)gis an external reference signal andrep- resents the controller parameters. To ease the nota- tion somewhat we will from now on omit the time argument of the signals. In addition, whenever sig- nals are obtained from the closed loop system with the controller fCr()Cy()g operating, we will in- dicate this by using the -argument thus, y() will denote the output of the system (1) in feedback with the controller (2). Let Td be a desired stable closed loop response from reference signal to output signal

y

d=Tdr: (3)

The error between the achieved and desired response

~

y() =y();yd is given by

~

y() = Cr()G0

1 +Cy()G0r;Tdr+ 1

1 +Cy()G0v: (4) It is natural to formulate the design objective as a minimization of some norm of ~y(), i.e.

 = argmin



J() = argmin



12E ~y2(): (5) Here E denotes expectation overvandrwhich we as- sume to be realizations of stationary stochastic pro- cesses. With T0() andS0() denoting the achieved

(2)

closed loop response and the sensitivity function with the controllerfCr()Cy()g, and given the statisti- cal independence ofrandv,J() can be written as

J() = 12Ehf(Td;T0())rg2+fS0()vg2i(6) The rst term is the tracking error, the second term is the contribution due to the disturbance. It is evident from (4) that J() depends in a fairly complicated way on . Furthermore, the true systemG0 and the spectrum offvgare unknown. The problem we would like to solve is to nd a solution forto the equation 0 =J0() = E~y()~y0()]: (7) This is done by taking repeated steps in a descent direction



i+1=i;iRi;1J0(i): (8) HereRi is some appropriate positive denite matrix, typically an estimate of the Hessian of J, such as a Gauss-Newton approximation of this Hessian. As stated this problem is intractable since it involves tak- ing expectation. It is, however, exactly a problem that can be attacked with stochastic approximation procedures, where one replaces J0 with an approxi- mation based on the current samples. In order to do this, the signal ~y(i) its gradient ~y0(i) is required. If a model of the plant is available, then this model can be used to compute this quantities. However, in 4]

and 5] it is shown that ~y(i) can be computed exactly and that ~y0(i) can be computed approximately using experimental data from (essentially) normal operat- ing conditions only. No explicit model is needed.

3. Model based criterion minimization

Instead of going directly for the controller parame- ters, one may want to estimate a modelG() of the open loop as an intermediate step. This is essentially only a question of reparametrization of the controller as a function of model parameters:  = (). The idea of optimization remains the same: it is the con- trol performance criterionJ(()) of (5) that is min- imized and not an identication criterion. The gra- dients Cr0(()) and Cy0(()) are now with respect to the parametersof the model. To compute these can be quite complicated, c.f. pole placement and LQG designs which involve Diophantine and Riccati equations. In, for example the case of a pole place- ment design, the reference modelTdis (possibly) also a function of1:

T

d() = Cr()G()

1 +Cy()G() Sd() = 1 1 +Cy()G()

(9)

1This will be the case when some of the zeros of the system are preserved in the design.

so thatyd yd(). As pointed out in 6] it is possible to write

~

y() =y();yd() =Sd()(y();G()u()): (10) Even after this rewriting, the minimization of

E(~y())2] cannot be viewed as a frequency weighted identication problem because the frequency weight- ingSd() and the input and output signals all depend on the unknown parameter vector. However, an it- erative procedure, that can be interpreted as identi-

cation using closed loop signals followed by control design, is obtained by keeping  xed to i, say, in all places in this expression except for G(), i.e. at the ith identication iteration one could minimize a square norm of the following errors:

e(i) =Sd(i)(y(i);G()u(i)): (11) Here y(i) andu(i) are the data obtained from the true system with the ith controller (computed from

G(i)) operating on the system. The identication step now is



i+1= argmin



F(i) (12) whereF is the quadratic function

F(xz) = 12E e2(xz) (13) The new parameter value i+1 and the new model

G(i+1) is used to update the controller, the new con- troller is applied to the true plant, new data are col- lected and the procedure is repeated. This procedure is suggested in 6]. The schemes found in 1] and 2]

also employ the same iterative identication/control procedure. They use control design objectives that do not involve a prespecied reference modelTd. An

H

1 criterion of the closed loop transfer functions is used in 2], while Zang et.al. 1] consider an LQG criterion.

4. Comparing the minima of the objective functions

We now examine the possibility that iterative identi-

cation and control schemes based on the above idea can converge to the minimum ofJ(). A convergence point#of the procedure satises



#= argmin



F(#): (14) A necessary condition for this is

F

x(##) = 0: (15)

(3)

This should be compared with the necessary condi tion for a minimizing point ofJ() =F():

J

0( ) =Fx(  ) +Fz(  ) = 0: (16) Thus, unless F is chosen properly,  does not have to satisfy (15) in order to be a minimizing point for

J(). The following example gives some insight.

Example: Let

y(t) =G0(q)u(t) +H0(q)e(t)

where fegis a zero mean white noise sequence with unit variance and where

G

0(q) =1;bqaq;1;1 H0(q) = 1 +aq;1 jaj<1: For simplicity we shall assume that the noise model

H

0 is known, i.e. H = H0. Suppose now that the purpose is to construct a minimum variance controller

C

y() = H;1

G() (17)

When the true system is in the model set,i.e. there is a such thatG( ) =G0, it is easy to verify that

 satises (15). Thus it is possible for the iterative scheme to converge to the (non-restricted) optimal controller. However, the situation changes if we con- sider a restricted complexity model. Let the model structure now be

G() =q;1 H(q) =H0(q):

Then (17) givesCy =a=and the output of the closed loop system then becomes

y(t) = 11;;a q2q;1;2e(t)

where =a(1;b=). This gives the output variance

J() = Ey2(t)] = 1 +a4;2a2 2 1; 2

Minimizing this expression gives = b. Hence the optimal restricted complexity control law is

u =;a

b y(t)

which gives the optimal closed loop pole = 0.

Turning to the iterative identication and control de- sign schemes the expression (11), in this example, is

e(xz) = Sd(z)(y(z);G(x)u(z))

= 1 +1 +axzaqq;1;1y(z)

Hence, the partial derivative of (13) w.r.t. its rst argument becomes

F

x() = Ee()ex()] (18)

= a

(1; 2)

;

;a

2+ (1;a2) +a3 A convergence point  = # must satisfy (15), and hence has to satisfy

a 2

;(1;a2) ;a3= 0 (19) It is easy to show that the stable solution (j j<1) is

#= 1;a2;

p1;2a2+ 5a4

2a a6= 0

and # = 0 for a = 0. As a function of a, # is continuous in the interval;1<a<1. Furthermore

lim

a!;1;

#= 1 and lim

a!1+

#=;1: Thus, the identication based iterative scheme for this restricted complexity model situation has a sta- tionary point which corresponds to a closed loop pole which can be anywhere in the interval (;11), de- pending on the open loop poleawhereas the optimal closed loop pole is at the origin regardless of a.

5. Numerical comparison

Let us now consider the problem of nding a con- troller for a third order system. In the indirect ap- proach it will be identied using output error (OE) models of rst order. The control design will be done using pole placement, and the design variable in the controller design will be the closed loop band- width!B. The reference signal is white noise ltered through a low pass lter with bandwidth!B. The it- erative design schemes starts by identifying a model from open loop data. The model is then used in the design of a pole placement regulator which is used for collecting a new set of data from the system, now act- ing in closed loop. This procedure is repeated until convergence and the value of the criterion after the last iteration is used for comparison. In each iden- tication step the designed sensitivity function from the previous iteration is used as prelter. The results of the simulations can be found in Table 1 where the achieved costJ of (5) are shown. It can be seen that for low designed bandwidths the iterative identica- tion and control scheme performs very well. How- ever, as the bandwidth is increased, the performance deteriorates dramatically. A reason is that the opti- mal (reduced complexity) control laws for the higher bandwidths correspond to unstable models, while OE

(4)

models always are stable. Hence, it is in these cases impossible for these schemes to give the optimal con- troller. Unstable models can of course be obtained using the ARX model structure, but only minor im- provements are obtained using this structure.

!

B Iterative Optimization Iter.Id./Control

1 2:4 10;5 2:4 10;5

2 2:2 10;4 2:6 10;4

3 8:0 10;4 9:8 10;4

4 2:3 10;3 2:6 10;3

5 4:8 10;3 8:8 10;3

6 8:6 10;3 7:7 10;2

7 1:5 10;2 1:0 10;1

8 2:2 10;2 1:2 10;1

9 3:1 10;2 1:5 10;1

10 4:2 10;2 1:7 10;1

Table 1: Loss function using Gauss-Newton optimiza- tion and iterative identication and control

6. Approximating the Gauss Newton minimization step by an identication step

We will now show that each Gauss Newton iteration in the direct minimization procedure can be approx- imated by an identication step. For simplicity we consider only the disturbance rejection problem, i.e.

r0. The servo problem gives the same result but the technical details are a lot more involved. We use the following simple model reference scheme

C

y() = Sd;1;1

G() : (20) HereSd represents a desired sensitivity function and is therefore a xed (-independent) quantity. For de- sign methods such as LQG, Sd() is the result of a model based optimization procedure and the results below are only valid approximately. Recall that in the

H

2 iterative identication and control procedure, the identication step minimizes (w.r.t. ) the quadratic norm of (11),

e(i) =Sd(y(i);G()u(i)) (21) whereSdis now independent of, and remember that

~

y() = e(). The key to link the minimization of the quadratic norm of this signal to the optimization procedure is the following technical result, which is proven in 5].

Lemma: Let ~zi() be a rst order Taylor expansion of ~y() aroundi,

~

z

i() = ~y(i) + ~y0(i)(;i): (22)

Then the Gauss Newton updatei+1, withi 1, is the solution of the minimization problem



i+1= argmin



1

N N

X

1

~

z 2

i(t): (23) Let us now derive an expression for ~zi(). For the present case of disturbance rejection (i.e. r0), we have:

~

y() =Sd(y();G()u()) = 1

1 +G0Cy()v=y() withCy as given in (20). Simple manipulations then show that

~

y

0() =S0()(1;S0()) 1

G()G0()v (24) Next notice that

G

0(i)(;i)G();G(i) (25) for close toi. Thus,

~

y

0(i)(;i)S0(i)(1;S0(i)) 1

G(i)(G();G(i))v which, after some simplications, gives (26)

~

z

i()S0(i)



S

0(i) +G()(1;S0(i)) 1

G(i)



v:

We now compare this expression with that ofe((27)i) in (21), and we observe that, if the achieved and de- sired sensitivity functions coincide, i.e.

S

0(i) = 1

1 +Cy(i)G(i) = 1

1 +Cy(i)G0 =Sd then (27) gives

~

z

i()Sd(y(i);G()u(i)) =e(i): (28) In view of (23) and (28), it follows that identifying a modelG() by minimizingPN1(e(i))2withe(i) as in (21) is approximately equivalent to taking a Gauss-Newton step in the minimization of J() pro- vided the true sensitivity function S0(i) coincides with the designed Sd. We conclude that the least squares identication step used in the iterative H2 identication and control schemes approximates the Gauss Newton step in the direct minimization scheme only if the present closed loop model is very close to the true closed loop system. A question that now re- mains is whether the Gauss-Newton step can be ex- pressed as an identication step also in the case when the model does not coincide with the true system.

We will thus attempt to formulate the minimization

(5)

of P1 zi() as an identication problem no matter whatG(i) is. Denote the right hand side of (27) by

w

i(),

w

i() =S02(i)v+G()S0(i)(1;S0(i)) 1

G(i)v:

The signalsS02(i)v andS0(i)(1;S0(i))G(1i (29)

) v can be obtained from two closed loop experiments. In the

rst experiment, the reference signal is zero, since we are doing disturbance rejection. In the second exper- iment one should use the output signal from the rst experiment as reference signal. Let as usual super- scriptj denote the experiment number, then



y

i

def= y1i ;yi2=S02(i)vi1;S0(i)vi2 (30)



u

i

def= ; 1

G(i)yi2=;(1;S0(i))S0(i) 1

G(i)vi1

;

1

G(i)S0(i)v2i: (31) Thus, neglecting the disturbance in the second exper- iment, we have



y

i

; G()ui=S02(i)v1i

+ G()S0(i)(1;S0(i)) 1

G(i)vi1 (32)

; S

0(i)(G()

G(i);1)v2i wi()

and therefore, ~zi() yi;G()ui. This shows that the minimization of (23) can be interpreted as an identication problem. We observe that two exper- iments with dierent reference signals are needed.

Furthermore, we have neglected the inuence of the disturbance in the second experiment. Taking this term into account we have



y

i

;G()ui=wi() +G(i);G()

G(i) S0(i)v2i: Thus, if i+1 is chosen as the minimum of the quadratic norm of yi;G()ui, this new parameter will be biased towards the previous one,i, since the term

E

"



G(i);G()

G(i) S0(i)v2i

2

#

is minimized by  =i. New parameter values will tend to stick to old ones and this can cause the pro- cedure to converge, not to the desired local mini- mum of J(), but to some other point. Thus, we conclude that it is better to use an explicit Gauss- Newton step instead of the identication based pro- cedure presented in this section. With a direct Gauss- Newton step one also avoids having to solve an iden- tication step which itself requires an iterative min- imization procedure. Before we close this section let

us point out that for design methods other than (20) the derivations are only approximate. Several addi- tional terms are then involved in the expression (24).

7. Conclusions

We have compared two approaches to iterative con- troller design. It has been shown that the schemes proposed in 6] and 1] do not necessarily converge to a local minimum of the design criterion if the mod- eling error is non-zero. With the iterative optimiza- tion approach in 4] and 5], convergence to a local minimum does indeed take place under the assump- tion of boundedness of the signals in the loop. When the method from 4] is used in an indirect (model based) scheme, this approach becomes an iterative model update and control design procedure. With a Gauss Newton parameter update and a model refer- ence control design procedure, we have shown that the model update step can be approximated by a least squares identication step, but with a bias error due to a disturbance. This identication step dif- fers from the identication steps in e.g. 6] and 1], in the way that the least-squares criterion contains an additional term which is obtained from a second experiment. This term is the explanation why the optimization based method does converge regardless of the model error. This term vanishes when the achieved and desired loops are identical. Thus, under this condition the Gauss-Newton identication step becomes (essentially) identical to the corresponding identication steps in the algorithms of 6] and 1].

References

1] Z. Zang, R.R. Bitmead, and M. Gevers.H2iterative model renement and control robustness enhancement. In Proc. of the 30th Conference on Decision and Control, pages 279{284, Brighton, 1991.

2] R.J.P. Schrama. Approximate Identication and Control Design. PhD thesis, Delft University of Tech- nology, 1992.

3] W.S. Lee, B.D.O. Anderson, R.L. Kosut, and I.M.Y Mareels. A new approach to adaptive robust control. Int.

Journal of Adaptive Control and Signal Processing, 7:183{

211, 1993.

4] H. Hjalmarsson, S. Gunnarsson, and M. Gevers. A convergent iterative restricted complexity control design scheme. InProc. 33nd CDC, Orlando, Florida, 1994.

5] H. Hjalmarsson, S. Gunnarsson, and M. Gevers.

Model free data driven optimal tuning of controller pa- rameters. Technical report, Report LiTH-ISY-R-1680, Department of Electrical Engineering, Linkoping Univer- sity, Sweden, 1994.

6] K.J. Astrom. Matching criteria for control and identication. InProc. ECC, pages 248{251, Groningen, The Netherlands, 1993.

References

Related documents

I dag uppgår denna del av befolkningen till knappt 4 200 personer och år 2030 beräknas det finnas drygt 4 800 personer i Gällivare kommun som är 65 år eller äldre i

Detta projekt utvecklar policymixen för strategin Smart industri (Näringsdepartementet, 2016a). En av anledningarna till en stark avgränsning är att analysen bygger på djupa

18 http://www.cadth.ca/en/cadth.. efficiency of health technologies and conducts efficacy/technology assessments of new health products. CADTH responds to requests from

Energy issues are increasingly at the centre of the Brazilian policy agenda. Blessed with abundant energy resources of all sorts, the country is currently in a

Den här utvecklingen, att både Kina och Indien satsar för att öka antalet kliniska pröv- ningar kan potentiellt sett bidra till att minska antalet kliniska prövningar i Sverige.. Men

Av 2012 års danska handlingsplan för Indien framgår att det finns en ambition att även ingå ett samförståndsavtal avseende högre utbildning vilket skulle främja utbildnings-,

Det är detta som Tyskland så effektivt lyckats med genom högnivåmöten där samarbeten inom forskning och innovation leder till förbättrade möjligheter för tyska företag i

Industrial Emissions Directive, supplemented by horizontal legislation (e.g., Framework Directives on Waste and Water, Emissions Trading System, etc) and guidance on operating