Dynamic Abstraction for Interleaved Task Planning and Execution

(1)

LinkopingStudiesinS ien eandTe hnology

ThesisNo. 1363

Dynamic Abstraction for Interleaved Task Planning and Execution

by

Per Nyblom

SubmittedtoLinkopingInstituteofTe hnologyatLinkopingUniversityinpartial

fullmentoftherequirementsfordegreeofLi entiateofEngineering

DepartmentofComputerandInformationS ien e

Linkopinguniversitet

SE-58183Linkoping,Sweden

(2)

(3)

Dynamic Abstraction for Interleaved Task Planning and Execution

by

PerNyblom

April2008

ISBN978-91-7393-905-8

LinkopingStudiesinS ien eandTe hnology

ThesisNo. 1363

ISSN0280{7971

LiU{Tek{Li {2008:21

ABSTRACT

Itisoftenbene ialforanautonomousagentthatoperatesina omplexenvironmentto

makeuseofdierenttypesofmathemati almodelstokeeptra kofunobservableparts

oftheworld orto performpredi tion,planningandother typesof reasoning. Sin e a

modelisalwaysasimpli ationofsomethingelse,therealwaysexistsatradeobetween

themodel'sa ura y andfeasibilitywhen it isused within a ertain appli ationdue

to thelimitedavailable omputationalresour es. Currently,thistradeoisto alarge

extentbalan edbyhumansformodel onstru tioningeneralandforautonomousagents

in parti ular. This thesisinvestigates dierent solutionswheresu h agents aremore

responsibleforbalan ingthetradeoformodelsthemselvesinthe ontextofinterleaved

taskplanningandplanexe ution. Thene essary omponentsforanautonomousagent

thatperformsitsabstra tionsand onstru tsplanningmodelsdynami allyduringtask

planningandexe utionareinvestigatedandamethod alledDAREisdevelopedthatisa

templateforhandlingthepossiblesituationsthat ano ursu hastheriseofunsuitable

abstra tionsandneedfordynami onstru tionof abstra tionlevels. Implementations

ofDAREarepresentedintwo asestudieswherebothafullyandpartiallyobservable

sto hasti domain are used,motivated byresear h with UnmannedAir raftSystems.

The asestudiesalso demonstratepossibleways to performdynami abstra tion and

problemmodel onstru tioninpra ti e.

This workhasbeen supportedby theSwedishAeronauti sResear hCoun il (NFFP4-

S4203), the Swedish National Graduate S hool in Computer S ien e (CUGS), the

SwedishResear hCoun il(50405001)andtheWallenbergFoundation(WITASProje t).

DepartmentofComputerandInformationS ien e

Linkopinguniversitet

(4)

(5)

Acknowledgements

IwouldliketothankmyadvisorPatri kDohertywhohasgivenmemoreor

lessfreehandstoinvestigatethis fas inatingeldof Arti ialIntelligen e.

IthastrulybeensomeofthemostinterestingyearsofmylifeandIapologize

foralwayspi kingsubje tsthatyouarelessfamiliarwith.

During mytime at theArti ial Intelligen e andIntegratedComputer

Systemsdivision(AIICS),Ihavere eivedvaluableinputfrommanypeople.

Spe ial thanks to Martin Magnusson, Fredrik Heintz, Per-MagnusOlsson,

DavidLanden,PiotrRudolandGianpaoloContefor ommentingdraftsof

this thesis and related papersat various(perhaps dynami allygenerated)

levelsofabstra tion.

ThankstoMartinMagnussonforprovidingthereformanyinteresting

andsometimesendlessdis ussionswhi hreallymakemegrowasa person.

Alsothanksto PatrikHaslumforyourendlesswisdomand forsupporting

meduringmyearlydevelopment. ThankstoFredrikHeintzforyoursense

ofdetailandperfe tionandJonasKvarnstromforyourin redibleproblem

solving apabilities(andwilltosharethem). ThankstoTommyPersonand

BjornWingmanforyourhelpwith alltheimplementationissuesand your

insightsintotheUASTe hsystem.

Finally, I thankmy parentsKurtand Gunilla,my girlfriendAnna and

mydaughterAnneliforloveandsupport.

(6)

(7)

Introduction

ItisdiÆ ulttooverestimatetheimportan eofmathemati almodelsinour

modern so ietybe ause of their ommon use ine.g. naturals ien es and

engineering di iplines for many dierent purposes. Many types of mod-

els existstoday that an be usedfor a variety of tasks su h as predi ting

weather, simulating vehi le dynami s, monitoring nu lear power rea tors

andverifying omputerprograms.

ModelshavealsobeenusedwithintheareaofArti ialIntelligen e(AI)

to develop autonomous agents. It is widely onsidered that su h agents

shouldhavemodelsoftheirenvironments(andthemselves)tomakeitpos-

sibletooperate moresu essfully. Themodels anforexamplebeusedto

keeptra kofthe unobservablepartsof theworld,performpredi tion[34℄,

taskplanning[31℄andothertypesofreasoning[10℄.

1.1 Models and Tradeoffs

One ommon trait formathemati al models usedinpra ti al appli ations

isthatitisnotalwaysbene ial(orevenpossible)tomodeleveryaspe tof

asystemofstudydowntothesmallestdetailtogetasa urateaspossible.

Theproblemis thatthere isalwaysatradeo between a ura y andfea-

sibility ofamodelthatshouldbeusedfora ertainappli ationona given

ar hite ture. Theremightbeademandfortimelyresponseofasystemthat

prohibitslongdeliberationtime whi h inturn anmakeahighlydetailed,

but omputationallydemanding model inappropriatefor use in that par-

ti ular domain. Althoughthe omputational resour esthat anbe made

availablefordierentappli ationshavebeenin reasingexponentiallysin e

thedawn ofele troni omputers,therewillalwaysbealimitwhen apar-

ti ularsystemis beingdeveloped anddeployed. Thismeansthat onewill

alwayshave totrade a model'sa ura y forfeasibilityto get a reasonable

performan einany future system,whi hisa fa tthat isoftenmentioned

intheliteratureaboutpra ti almathemati almodelling[44℄[24℄.

(12)

1.2 Task Environments and Models

Whenamodelistobe onstru tedforanautonomousagent,itisimportant

to onsiderthetaskenvironment [63℄inwhi htheagentwilloperate.The

omplexity of the task environment an give signi ant hints about the

dierenttypesof models that anbeusedforwhateverthepurposeofthe

model is.

Ataskenvironment,whi h anbeeitherrealorsimulated,spe ies:

what theagent andototheenvironmentwith itsa tuators,

what informationit anre eivefromitssensors,

howtheenvironment works andwhat it ontains,and

what is onsidered \good or bad" with the help of a performan e

measure

A task environmentfor an autonomous ground robot an e.g. spe ify

thatthea tuators onsistofapropulsionsystemandpossiblyamanipulator

arm. Su hagentsarealsotypi allyequippedwithsensorssu haslaserrange

s anners, amerasand sometimes ollisionsensors. Theenvironmentmay

onsistoftables, hairs,walls,stairset .,anditsperforman emeasuremay

be dened in terms of power onsumption and the time to omplete an

assignedtask(su hasdeliveringapa kage).

Amodelthatanagentusesshouldbe losely onne tedtothetaskenvi-

ronmentthattheagentoperateswithin. Forexample,ifamodelisgoingto

beusedforpredi tingthestateofanautonomousagent'staskenvironment

depending on what a tions it performs, it better in lude spe i ations of

howthe a tuators,sensors andthesurrondingsworkinordertobeuseful.

Su hataskenvironmentmodel annotbetoodetailedduetothetradeo

betweena ura yandfeasibility.

A task environment or a model thereof an be lassied a ording to

some ommonly used dimensions [63℄ whi h to a large extent determine

howdiÆ ultitis tohandle.

Fully Observable

^or

Partially Observable

^: ^If^the ^agent's^sensors

an give a ess to all the relevant information in the environment

it is alled a fully observable task environment; otherwise the task

environmentis alledpartially observable.

Deterministic

^or

Stochastic

^: Îf^the^next^state îsômpletely^deter-

minedbythe urrentstateand thea tionexe utedbytheagentthe

taskenvironmentis alleddeterministi . Ifthereareseveralpossible

out omesofana tionitis alledasto hasti environment. Theterm

non-deterministi is often usedwhen out omes do nothave proba-

(13)

1.2. TASKENVIRONMENTSAND MODELS 3

Episodic

^or

Sequential

^: În ân êpisodi environment, the agent's urrent de ision does not in uen e the performan e of any future

episode. All environments onsidered in this thesis will be sequen-

tialwhi hmeansthattheagent's urrentde isionmightin uen ethe

performan eoftheagentinfuturestates.

Static

^or

Dynamic

^: ^A ^task environment whi h may hange while theagentdeliberatesis alledadynami environment;otherwiseitis

alledstati .

Discrete

^or

Continuous

^: ^A ^ontinuous environment ontains elements that are more a urately des ribed with ontinuous models

involving real values instead of an enumerable set of values. Task

environments that do not have any ontinuous elements are alled

dis rete.

Single Agent

^or

Multiagent

^: ^A ^taskenvironmentwhereotherex- ternalagents,besidesthemainagentitself,trytorea hgoalsormax-

imizetheir utilitiesare alled multiagent. Ifthe externalagentsare

betterdes ribedwithoutde ision apabilities,orifnoexternalagents

exist,theenvironment anbe onsideredsingle agent.

Inthis thesis, thesedimensionsareusedto lassifythe intrinsi prop-

ertiesofataskenvironment.Theyarenotassumptionsthate.g. adesigner

ofanagent anmake. Ontheotherhand,adesigner anmakeassumptions

thatarere e tedintheagent'staskenvironmentmodelsthatitissupposed

touse. Constru tedmodelsthatrepresentpartsofataskenvironmentmust

oftenbea simpli ationof therealthingand thedierentdimensionsare

then usedto lassify the model onstru tionassumptionsthat are notal-

ready a property of thetask environment. Thiswill bedis ussedmorein

Se tion1.4.

It is assumed that task environment models an be simulated. This

meansthatdierenta tions anbetestedwiththemodelwhi hmayresult

in one or several possible out omes depending on whether the model is

deterministi ornot. Sto hasti models anbesimulatedbypseudorandom

numbergenerators.

A task environment lass orenvironment lass isa set of taskenvi-

ronmentswithsimilarproperties. Anagentisoftendesignedto operatein

instan esofaparti ulartaskenvironment lasswheree.g. theenvironment

an ontainadierentnumberofobje tsand agentsbutmostoftheother

propertiesor assumptionsstay the same. In this thesis, the taskenviron-

ment instan esina parti ularenvironment lass are assumedto havethe

same lassi ation a ordingto the previously mentioned dimensionsand

thatthe a tuatorsand sensorsaresimilarlymodelled. Withina parti ular

environment lass,thetypesoftheobje tsintheenvironmentalsostaythe

samebutthenumberandinitial onditions mayvary intaskenvironment

Dynamic Abstraction for Interleaved Task Planning and Execution

Dynamic Abstraction for Interleaved Task Planning and Execution

Per Nyblom

Dynamic Abstraction for Interleaved Task Planning and Execution

Acknowledgements

Contents

1 Introduction 1

2 Preliminaries 16

3 Dynamic Decision Networks 28

4 The DARE Method 37

5 Case Study I 49

6 Case Study II 69

7 Conclusion 86

Chapter 1

Introduction

1.1 Models and Tradeoffs

1.2 Task Environments and Models

Fully Observable

Partially Observable

Deterministic

Stochastic

Episodic

Sequential

Static

Dynamic

Discrete

Continuous

Single Agent

Multiagent