Model Error Modeling and Control Design

(1)

LennartLjung

Department of Electrical Engineering

Linkping University, S-581 83Linkping, Sweden

WWW: http://www.control.isy.liu .se

Email: ljung@isy.liu.se

2000-03-07

REG

LERTEKNIK

AUTO

_{MATIC CONTR}

OL

LINKÖPING

Report no.: LiTH-ISY-R-2220

Forthe IFACSymposium on System Identication, SYSID2000,

SantaBarbara, CA, July 2000

Technicalreportsfrom theAutomatic Controlgroupin Linkpingare available

by anonymous ftp at the address ftp.control.isy.liu.se. This report is

(2)

DESIGN

Lennart Ljung

Divisionof AutomaticControl,LinkopingUniversity,

SE-58183, Linkoping, Sweden, email: ljung@isy.liu.se

Abstract: Model validation and estimating the size of a possible model error is a

central aspect of System Identication. In this contribution we discuss the model

errorconceptsandmodelerrormodelingforcontroldesign.Ofspecialinterestishow

to makeuse of periodic inputs, and how to deal with non-linear errormodels. The

discussionislimitedto SISOmodelsandstabilityrobustnessissues.

1. INTRODUCTION

Inthiscontributionweshallconsider the

estima-tionoflinearmodels,ofthetype

y(t)=G(q;)u(t)+H(q;)e(t) (1)

Thisisofcourseastandardtopicintheliterature,

e.g.,(Ljung 1999a). Theparameter vector will

be estimated using a prediction error method,

leading to an estimate ^

N

with corresponding

transferfunction estimate

^ G N (q)=G(q; ^ N ) (2)

Thisestimatewill thenbeused todesigna

regu-lator

u(t)=K(q)(r(t) y(t)) (3)

sothat theclosed loopsystem

y(t)= K(q) ^ G N (g) 1+K(q) ^ G N (q) (4) behaveswell.

Theproblemisthat ^

G

N

willonlybean

approxi-matedescriptionofthetruesystem.Theextensive

literature on robust control design, e.g., (Zhou

et al. 1996), (Skogestad and Postlethwaite1996)

deals with thequestion to ensure good behavior

oftheclosed loopsystem,despiteinaccuraciesin

themodel ^

G

N .

When the model has been identied from data,

the uncertainty of the model. The interplay

be-tweentheestimationandthedesignprocesseshas

receivedsubstantialattentionoverthepastyear,

e.g.,(SmithandDoyle1992),(Kosutetal.1992a),

(RanganandPoolla1996),(Skelton1989),(Gevers

1993),(Kosutet al.1992b),(SmithandDullerud

1996),(Poollaetal.1994).

Part of this problem is to estimate, in reliable

way, the model error or model uncertainty, not

assuming that the true system canbe described

withinthechosenstructure(1).See,e.g.,(Ninness

and Goodwin 1995), (Ljung 1999b) Then it is

mostlyassumedthatthetruesystemislinear,and

givenby atransferfunction G

0

(q),and thatthe

modelerroriscapturedbyestimatesofthesizeof

thefrequencyfunction

~ G N (e i! )= ^ G N (e i! ) G 0 (e i! ) (5)

In this contribution we shall consider the more

general situation of possibly non-linear model

errors,anddiscusstherolesthatanexplicitmodel

errormodelwillplayinsuchaset-up.

The discussion is conned to

single-input-single-output systems/models and to stability

robust-nessissuesonly.Thisis asimplicationanddoes

(3)

2. MODELVALIDATIONANDMODEL

ERRORMODELING

To validate a model is to confront it with facts

aboutthesystem.Anidealsituationisthatthere

is prior knowledge about the system which can

becompared withthe model's properties. Often,

however,theonly\facts"thatareathandarethe

datathemselves.Itisdesirablethatthedataused

forvalidation arenotthesameasforestimation,

a separate \validation data set" should be used.

It then boils dow to comparing what the model

thinkstheoutputshouldbewithwhattheactual

measured output turned out ot be. That is, the

residuals, the model left-overs, should be

exam-ined:

"(t)=y(t) y^

N

(t) (6)

Herey^isthesimulatedoutput

^ y N (t)= ^ G N (q)u(t) (7) ^ G N

isthenominal model, whichmaybetheone

estimated in (2), be may alsobechosenin other

ways.

Ifanoisemodel ^

Hhasbeenestimateditisnatural

toconsideralteredversion

" F (t)= ^ H 1 N (q)"(t) (8)

andsubjectthatto theanalysis below.

2.1 Classical Correlation AnalysisofResiduals

One of the most basictests, (Draper and Smith

1981),is to computethecorrelation betweenthe

regressors, in our case the past inputs, and the

residuals: ^ r N ()= 1 N N X t=1 u(t )"(t) (9)

Itis customaryto plottheseestimatesasa

func-tion of and compare with theirstandard

devi-ations to check if they are signicantly dierent

from zero.If not,wehavenot tracedany

signi-cantin uenceofuin",sowecannotsaythatthe

model ^

Ghasnot pickedupall thein uence ofu

ony.(Notethedoublenegation:wearenotsaying

that\ ^

Ghaspickedupall...").Itisconvenientto

form '(t)= u(t 1) ::: u(t M) T (10) h M N = 2 6 4 ^ r N (1) . . . ^ r N (M) 3 7 5 = 1 N N X t=1 '(t)"(t) (11)

Under theassumption that " iswhite noisewith

variance,hhasanormaldistributionwithzero

meanandvariance=NR

N ,where R N = 1 N N X t=1 '(t)' T (t) (12) M N = 1 N k N X t=1 '(t)"(t)k R 1 N (13)

will in this case have a 2

distribution, and the

familiar 2 -test M N < (14)

isbasedonthis.Theresultistypicallypresented

as a plot of the autocorrelation of the residuals

andaplotof (9).SeeFigure1.

Notethatotherkindofdependencescanbetested

quiteanalogouslybyletting'(t)beother,

nonlin-ear,functions ofpastinputs:

'(t)=f(u t

) (15)

2.2 CorrelationAnalysisasModelErrorModeling

0

5

10

15

20

25 −0.4

−0.2

0

0.2

0.4

0.6

0.8

1 Correlation function of residuals. Output # 1

lag

−25

−20

−15

−10

−5

0

5

10

15

20

25 −0.2

−0.1

0

0.1

0.2

0.3 Cross corr. function between input 1 and residuals from output 1

lag

Fig. 1. Traditional residual analysis: Auto- and

cross-correlation functions with uncertainty

regions.

Theinformationfromthecrosscorrelation

analy-sisbetween"andu,canalsobeinterpretedasan

implicitFIRmodel forthetransferfunction ~

Gin

"(t)= ~

G(q)u(t)+w(t) (16)

from u to ". For control purposes, it is much

more eective to present the (amplitude)

fre-quency function of this model error model, with

uncertaintybounds asinFigures 2{4.Thedata

usedin theseguresaresimulatedfromasecond

orderARMAXmodel.Itisclearthatconventional

model validation corresponds to increasing the

modelcomplexityuntilthemodelerrormodelhas

uncertaintyboundsthatincludezero(asinFigure

4), since then there is no clear evidence that ~

G

is not zero { the estimated model is then not

falsied. But it is also clear that the two plots

together;themodelandits\sidekick",themodel

errormodel,canbeusedforcontroldesign,evenif

themodelisfalsied.LookatFigure3.According

(4)

well beused forcontrol designiftheinformation

inthelowerplotistakenintoaccount.

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

Fig. 2. Upper plot: Amplitude Bode plot of a

rstordermodelwithestimateduncertainty

bounds. The true system is also plotted.

Lowerplot:Themodelerrormodelcomputed

asa20:thorderARXmodelfromutoy ^ Gu

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

Fig. 3. As in previous gure, but second order

ARXmodel

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

10 −1

10

0

10

1

10

2

10 −4

10 −2

10

0

10

2

Fig. 4. As in previous gure, but second order

ARMAXmodel.Herethemodelerrormodel

containszeroinitsuncertaintyregion,which

meansthatthetopmodelisnotfalsied.

GAINS

3.1 TheGeneralForm

A model error model is a description of how u

aects ".We willtypicallynotbeinterestedin a

detailed suchdescription{that shouldrather be

doneasamoredetailednominalmodel.Primarily

wefocusonbounds onthegainof such amodel.

To go well together with linear control design,

we shall work with combinations of frequency

weightings and unstructured, unknown models

withbounded(estimated)gain.Thatmeansthat

thegeneralstructureofthemodelfromuto"can

bedepictedasinFigure5.Formallywehave

w " ~ g mem u W 1 W 2

Fig.5.Themodelerrormodel

" F (t)=W 1 2 (q)"(t); u F (t)=W 1 (q)u(t) (17) " F (t)=g~ mem (u t 1 F )+w(t) (18) Here, W 1 and W 2

are given linear lters,

super-scriptt 1denotesallthesignalvaluesuptotime

t 1,andg~

mem

is anuncertainmodel. The

bot-tomlineisthat,withsomeamountofcondence,

basedonthemeasureddata,andpossiblyonprior

information,weshouldbeabletosaythatitsgain

isboundedby1: k~g mem k 1 <1 (19)

Normalization can always be achieved using W

1

andW

2

.Weshallshortlydiscusshowtoestimate

suchabound.

Themodel~g

mem

cancome indierentshapes.It

couldbe

Linear Time Invariant. In this case we can

take either W

1 orW

2

to be unity, since all

theblocksinin Figure5commute.

LinearTimeVarying.

GeneralNon-Linear.

3.2 Estimating the Gain of the Model Error

Model: DistinguishingModel Errorfrom Noise

TheestimationtaskistodeterminetheltersW

i

andestimatethegainofg~

mem in " F (t)=g~ mem (u t 1 F )+w(t) i.e.estimate sup t sup t k~g mem (u t )k ku t k

(5)

to determine this gainwithout prior knowledge.

Even in the linear case, an arbitrarily thin

res-onance peak will require an arbitrarily long

se-quence of data, due to theuncertaintyprinciple.

After having considered a nite data record, no

guarantee canbe given for the \true"gain. The

estimate of the gain will always re ect the data

thathavebeenmeasured,possiblyinconjunction

withpriorknowledge.

Arstfundamental problemistodistinguishthe

model error ~g

mem

from the noise contribution

w. Loosely speaking, the noise w would be that

contribution to " that would not change if u is

changed.Thisproblemisaddressedin(Smithand

Dullerud 1996) by looking at trade-os between

modelerrorsizeandhardamplitudeboundsonw.

From astatistical perspective,it ismorenatural

totryand\correlateout"w.Therearetwobasic

methodsforthat:

Useaperiodicinput,andeliminate/reducew

byaveragingovertheperiods.

Build a(parametric) model of ~g

mem

in (18)

assuminguandwtobeindependent.

3.3 UsingPeriodicInputs

Suppose the noise contribution is additive as in

(18)andthat uisperiodicwith periodP. Then,

afteratransienthasdiedout,alsoy^

N

(t)(see(7))

will be periodicwith thesameperiod. Moreover,

thepartof y that originatesfrom theinputwill,

under weak assumptions, also be periodic with

periodP.Letydenotetheoutputoveroneperiod,

obtained by averaging over all the periods, and

similarlyfor" F (t): " F (t)=W 1 2 (q)(y(t) y^ N (t)); t=1;:::;P

From(18)wenowhave

" F (t)=~g mem (u t 1 F )+w(t)

where w will tend to zero asN tends to innity

(moreandmoreperiodsareaveragedover).

A simpleunder-bounding estimateof thegainof

~ g mem is \ k~g mem k 2 = P P t=1 " 2 F (t) P P t=1 u 2 F (t) (20)

A natural choice of lter is W

1 (q) =

^

G (q) (the

nominalmodel).Thismakes

u F (t)=y^ N (t) (21) Moreover,W 2 canbechosenas W 2 (q)= ~ H(q),a

\noisemodel"fory(t) y^

N (t), y(t) y^ N (t)= ~ H(q)(t) (22)

with (t) beingwhite noise. That will makethe

numeratorin(20)assmall aspossible.Notethat

thismakes \ k~g mem k 2 = P P t=1 [W 1 2 (q)(y(t) y^ N (t))] 2 P P t=1 ^ y 2 N (t) (23)

a measure of the proportion of \unexplained"

output variation.Thishasacloserelationshipto

themultiplecorrelationcoeÆcientusedinresidual

analysis,page33in (DraperandSmith1981).

3.4 GeneralLinearBlackBoxModelErrorModels

The model error model concept gives us more

freedom in investigating the residuals than the

classical residual correlation test, which

essen-tiallyemploysaFIRmodel.Amoregenerallinear

model "(t)= ~ G(q)u(t)+ ~ H(q)w(t) (24)

could improve the estimate of the error model,

sinceamodelofthedisturbanceisused.

Insteadofparametriclinearmodels,wemayapply

spectral analysis to try and extract any linear

in uence of uon ",(Kosut 1986),(Stenman and

Tjarnstrom2000).Inanycase,thereisaclose

re-lationship between the Blackman-Tukeyspectral

analysisestimateofthistransferfunctionandthe

oneobtainedbyaFIR-model.

Once we have a specic, linear model of the

in uence from uto ", along with an uncertainty

(condence) region, the error model in Figure 5

is directly dened:Take W

2 =1 and W 1 as ~ G N +

. Here is the estimated variance error (at a

certain condence level), so that the amplitude

curve of W

1

corresponds to the upper curves in

(thelowerplotsof)Figures2-4.Ifnononlinearity

hasbeendetectedin theerrormodel,this choice

ofW

i

inFigure5givesacorrectdescriptionofthe

erroranduncertaintyassociatedwiththenominal

model.

3.5 Non-linear Model Error Models

Intheliterature,mostmodelerrordiscussionsas

well as the identication-for-control approaches

aredealtwithinasettingwhere\thetruesystem"

isahighorderlinearmodel,andthemodelsareof

lowerorder.In practicaluse,itis ofcourse more

common that the model errors are ignored

non-linearities rather than unmodeled linear

dynam-ics. From a model error perspective, this simply

meansthatweshould testnon-linearmodels:

" F (t)=g~ mem (u t 1 F )+w(t) (25)

(6)

neural network NNFIR model, cf (Sjoberg et al. 1995): " F (t)=~g mem (u F (t 1);:::;u F (t m);)+w(t) (26)

Thenumberoflaggedinputscanbechosen

rela-tivelysmall here,likem=5orso.Toappreciate

thesize (gain) ofanyestimated non-linearity(in

particularforcontrolapplications)itisnaturalto

usethesup-norm

k~g mem k 1 = sup u1;:::;um j~g mem (u 1 ;u 2 ;:::;u m ;)j^ 2 u 2 1 +:::+u 2 m (27)

Then alsodetermine theworstcase valueof this

gaininaproperlychosencondenceregionforthe

estimate:^ k~g mem k=sup ^ 2 k~g mem (u(t 1);:::;u(t m);)k^ 1 (28)

4. ROBUSTCONTROLDESIGN FOR

UNCERTAINSYSTEMS

4.1 LinearModelErrors

Once a model with its model error uncertainty

is delivered, the question is how to design a

controller that will stabilizethe systemrobustly.

Bythiswewouldmeanthatthechosencontroller

should stabilize allmodels inthe \region"dened

bythe nominalmodelandthe model error model.

Incasewehaveusedalinearmodelerrormodel,

this region is easily depicted in the frequency

domain. It will look likea strip in the Bode, or

Nyquistplot.SeeFigures2-4,i.e.

G2G=fGj jG(e i! ) G nom (e i! )j<(!)g (29)

How to achieverobust stability forsuch aset of

modelsiswellknown:ChoosearegulatorK,such

thatthecomplementarysensitivityfunction

T = G nom K 1+G nom K (30)

islessthantheinverserelativemodelerrorbound:

jT(e i! )j< G nom (e i! ) (!) ; 8! (31) H 1

techniques canbeused to determine ifsuch

aKexists,forgivenG

nom

and.See,e.g.(Zhou

etal.1996).

4.2 Frequency Weighted Non-linearModel Error

Model

The error model (18) corresponds to a closed

loop block diagram as in Figure 6. This can be

y ~ g mem G nom u K W 1 W 2

Fig. 6. Block diagram of the feedback loop with

model error

rearranged to be seen as feedback between the

non-linear part of the error model ~g

mem and KW 1 W 2 =(1+KG nom

)(keepingin mindthat we

onlyconsiderSISOmodelshere).Supposethatwe

normalizeW

1 W

2

sothatthegainofthenon-linear

partissubjectto k~g mem k 1 <1 (32)

The small gaintheorem tellsus that stability is

assuredif W 1 (e i! )W 2 (e i! )K(e i! ) 1+K(e i! )G nom (e i! ) <1 8! (33)

Comparingwith (31)werealizethat we just can

considerthesetofpossiblesystemdescriptionsto

belinearandgivenby

G2G=fGj jG(e i! ) G nom (e i! )j<::: (34) <W 1 (e i! )W 2 (e i! )g

By stabilizing any linear model in this set, i.e.,

achieving(31)for=W

1 W

2

,wehavealsomade

thelinearcontroldesignrobustagainstnon-linear

model errorsof the type (18).(Considering only

stabilityrobustness.)

5. ESTIMATINGNON-LINEAR FREQUENCY

WEIGHTEDERRORMODELS

Whendealingwithnon-linearmodelerrormodels

forcontroldesign,theoptionoffrequency

weight-ing isessential.As anexample,supposethetrue

systemis ~ y(t)= 1 (q 1)(q ) u(t) (35) y(t)=f(~y(t)) (36)

forsomestatic,linearlybounded, nonlinearityf.

Suppose,forsimplicity,thatthenominalmodelis

G

nom

(q) = 1=(q 1)(q ) so the model error

isthisnominalsystemfollowedbythestatic

non-linearityf(y) 1.Suppose that kf(y) 1k<1.

Thegainofthismodelerrormodelisinnite,due

totheinnitegainofthelinearpart.So,thiserror

modeldoesnotdomuchgoodforcontroldesign.

IfwenowpicktheweightingsW

1

(q)=1=(1 q),

(7)

weightingW

2

in (33)is no majorproblem, since

thedenominatoralsohasinnitegainatzero.

NotethatitisonlytheproductW

1 W

2

thataects

therobustnesstest,whilethesplitintoW

1 andW

2

mayhavea substantial in uence onthe norm of

~ g

mem

.Thisgivesanextradimensiontothemodel

error modeling. In loose terms the weightings

should be chosen so as to minimize k~g

mem k 1 whileletting W 1 W 2

haveashapethat suits(33).

A morecomprehensive discussionof howto deal

with this freedom has to be deferred to another

occasion.Somehintsare, however,asfollows:

Using W

2 =

^

H (the nominal noise model)

reducesthe poweroftheoutput of the

non-linearblock as much aspossible, givingthe

possibilityofasmallergainofthenon-linear

part.

Using W

1 =

^

G (the nominal model), will

focustheerrormodelontherelativeerror.

6. CONCLUSIONS

Building linear models may be deceptive. Given

any data set,of arbitrary length, wecan always

build a linear model, that will pass standard

residual analysis validation tests. The reason is

that any (non-linear, time-varying) system from

whichquasistationaryinput-outputsequencesare

collected,hasalinearsecondorderequivalent.We

cannotdistinguishthetruesystemfromthislinear

model, using just secondorder tests.This means

that the standard linearidentication setup will

deliveramodelwithuncertainty

^

G

where the(variance)error will tendtozero as

moredata are processed. Subjecting the data to

themodel-error-modeltreatmentdescribedinthis

contributionwillgiveamodel

^

G; =W

1 W

2

where will tend to zero, only if non-linearity

testsshownosignofnon-linearities.Otherwise

will re ect aremaining uncertainty/model error.

However,ifwejust consider stabilityrobustness,

itwillbesafetotreatthesystemasalinearone,

withthegivenuncertainty, evenifnon-linearities

havebeendetected.

7. REFERENCES

Draper, N.R. and H. Smith (1981). Applied

Re-gressionAnalysis,2nded.. Wiley,NewYork.

Gevers,Michel (1993).Towards ajointdesign of

identicationandcontrol?.In:Essayson

con-trol:Perspectivesinthetheoryandits

appli-cations (H L Trentelman and J C Willems,

Kosut, R., M. K. Lau and S. P. Boyd (1992a).

Set-membership identication of systems

with parametric and nonparametric

uncer-tainty.IEEETrans.AutomaticControl

AC-37,929{941.

Kosut, R.L.(1986).Adaptivecalibration:An

ap-proach to uncertainty modeling and on-line

robust control design. In: Proc. 25th IEEE

Conference on Decision an Control. Vol. 1.

Athens,Greece.pp.455{461.

Kosut,R.L.,G.C.GoodwinandM.P.Polis(Eds)

(1992b). Special Issue on System

Identica-tionforRobustControlDesign,IEEETrans.

AutomaticControl,Vol37.

Ljung,L.(1999a).SystemIdentication-Theory

for the User. 2nd ed.. Prentice-Hall. Upper

SaddleRiver,N.J.

Ljung, Lennart (1999b). Model validation and

model errormodeling. In:The

Astrom

Sym-posiium on Control (B. Wittenmark and

A. Rantzer, Eds.). Studentlitteratur. Lund,

Sweden.pp.15{42.

Ninness, B. and G. C. Goodwin (1995).

Es-timation of model quality. Automatica

31(12),1771{1797.

Poolla,K.,P.P.Khargonekar,A.Tikku,J.Krause

and K.Nagpal (1994). A time-domain

ap-proachto model validation.IEEE Trans.on

AutomaticControl AC-39,951{059.

Rangan, S. and K. Poolla (1996). Time-domain

validation for sample-data uncertainty

mod-els. IEEE Trans. Automatic Control

AC-41,980{991.

Sjoberg, J., Q. Zhang, L. Ljung, A. Benveniste,

B. Delyon, P.Y. Glorennec, H. Hjalmarsson

andA. Juditsky(1995).Nonlinearblack-box

modeling in system identication: A unied

overview.Automatica31(12),1691{1724.

Skelton, R. E. (1989). Model error concepts in

controldesign.Int.J.Control49,1725{1753.

Skogestad, S.and I.Postlethwaite(1996).

Multi-variableFeedbackControl.JohnWiley.New

York.

Smith,R.andG.E.Dullerud(1996).

Continuous-time control model validation using nite

experimental data. IEEE Trans. Automatic

ControlAC-41,1094{1105.

Smith, R. and J.C. Doyle (1992).Model

valida-tion:aconnectionbetweenrobustcontroland

identication. IEEE Trans. Automatic

Con-trolAC-37,942{952.

Stenman,AndersandFredrikTjarnstrom(2000).

A nonparametric approach to model error

modeling. In: Preprints of the 12th IFAC

Symposium onSystemIdentication. To

ap-pear.

Zhou, K., J. C. Doyle and K. Glover (1996).

Robust and Optimal Control.Prentice-Hall.