Generalized Planning with Loops under Strong Fairness Constraints Giuseppe De Giacomo1 1 Dipartimento

Fabio Patrizi1

di Informatica e Sistemistica Sapienza Universit` a di Roma Rome, Italy [email protected] [email protected]

Sebastian Sardi˜ na2

2 School

of Computer Science and IT RMIT University Melbourne, Australia [email protected]

KR 2010 - Toronto, May 12, 2010

1 / 16

Nondeterministic Planning Domains Nondeterministic Planning Domain D: A finite set P of propositions –whose subsets are called states–, capturing all domain’s relevant features A finite set A of actions, to be executed in the domain A transition relation −→⊆ 2P × A × 2P , representing the (possibly non-deterministic) domain dynamics, subject to action executions

Example (Coin Tossing) P = {head, tail, holding }, A = {grab, toss, turn} grab

toss

toss

turn

ρ = {∅ −→ {holding }, {holding } −→ {head}, {holding } −→ {tail}, {head} −→ turn

grab

grab

{tail}, {tail} −→ {head}, {tail} −→ {holding }, {head} −→ {holding }}

2 / 16

Conditional Planning Problems Conditional Planning Under Full Observability (For now, w/o loops) INPUT: I I I

a nondeterministic domain D = hP, A, ρi an initial state S0 ⊆ P a propositional goal formula γ over P

SOLUTION: a conditional plan π s.t. all executions achieve γ COMPLEXITY: EXPTIME-complete (also w/loops)

Example hS0 = ∅, γ = headi on Coin Tossing solved by plan: 1 grab 2

toss

3

if(¬head) then turn

grab ∅

{holding }

toss toss

{head}

turn

{tail}

3 / 16

Conditional Planning with Loops Loops allowed in plans

Example π = while (¬head) {grab; toss} {tail} toss grab



grab

{holding }

toss

{head}

Clearly, π does not solve hS0 = ∅, γ = headi... ...however, in the real world, everyone would bet it eventually does! We want to assert non-local constraints! 4 / 16

Conditional Planning with Loops (2) Previous work on Strong Cyclic Planning [CPRT03] assumes fair action executions: I I

I

All action effects eventually occur Cannot distinguish between fair and unfair action executions (either all or none!) Thus, cannot make decisions based on this

In this work We explicitly characterize relevant runs, through constraints Capture two different flavours of nondeterminism: I

I

Uncertainty: the effect will occurr, but don’t know exactly when (e.g., rolling a die) Partial Knowledge: may or may not occur (e.g.: picking cards from a deck with a possibly missing Ace)

5 / 16

Constraints on Runs Runs: possible evolutions of a domain, generated by executing plans

Example (π executions) grab ∅ grab ∅

{holding } {holding }

toss toss

grab

{tail} {head}

{holding }

nop

toss nop

{head}

{head} {head}

nop nop

{head} {head}

nop nop

{head} {head}

nop nop

··· ···

... grab ∅

{holding }

toss

{tail}

grab

{holding }

toss

{tail}

grab

{holding }

toss

Constraints: LTL formulas built from propositions in P ∪

{tail}

S

grab

···

a∈A {(act

= a)}

Example (Constraints on D runs) Tossing a coin infinitely often yields head to occur infinitely often ♦(act = toss) → ♦(head)

6 / 16

Constraints on Runs (2) Constraints: LTL formulas to be evaluated on domain runs We use run constraints to rule out irrelevant runs Only runs satisfying all constraints are significant

Semantics of constraints on domain runs Given: a planning domain D with a finite set C of constraints on runs initial state S0 and a goal formula γ A conditional plan π with loops achieves γ (π |= >Uγ) if all of its executions satisfying all C constraints reach a state S s.t. S |= γ LTL is very natural: Conditional Planning focuses on single executions (run-by-run) 7 / 16

Constraints on Runs (3) We use run constraints to assert non-local domain properties

Example (More realistic Coin Tossing domain) If we assert the following constraints on Coin Tossing: ♦(act = toss) → ♦(head) ♦(act = toss) → ♦(tail) Then plan π = while (¬head) {grab; toss} solves hS0 = ∅, γ = headi Indeed, the only unsuccessful π execution grab ∅

{holding }

toss

{tail}

grab

{holding }

toss

{tail}

grab

{holding }

toss

{tail}

grab

···

is discarded by first constraint above 8 / 16

Strong Fairness Constraints Strong Fairness If something φs happens infinitely often, then something else φe happens infinitely often ♦φs −→ ♦φe (φs and φe essentially propositional, (Next) allowed)

Strong Fairness captures also: I I

Weak fairness (something φ happens infinitely often): ♦> −→ ♦φ Persistence (something φ holds from a point on): ♦¬φ −→ ♦⊥

Not expressible in CTL [CGP99] Can phrase typical properties of our interest Good computational behavior (wrt Conditional Planning, see below) Existing results on LTL synthesis [PPS06, KPP05] apply 9 / 16

Synthesis in LTL LTL Synthesis Problem: X¯ 0 , Y¯

System ΦS (X , Y) X¯ , Y¯

X Y

Controller ΦC (X , Y) X¯ 0 , Y¯ 0

Uncontrolled (X = {x1 , . . . , xn }) and controlled (Y = {y1 , . . . , ym }) vars System assigns X vars (moves first), Controller assigns Y vars Both have their own structural assumptions (constraints on execution) Objective: Find a controller strategy satisfying an LTL specification ϕ (Technically, ϕ combines ΦC , ΦS , and desired requirements. In particular, ϕ requires the strategy to meet ΦC when interacting with ΦS ) 10 / 16

Synthesis in LTL (2) Complexity: For arbitrary ϕ, the problem is 2EXPTIME-complete [PR89] GR(1) formulas yield an EXPTIME bound [PPS06, KPP05] Generalized Reactivity (1) Specifications: ϕ = ϕa −→ ϕr ϕa : System structural assumptions + possible (weak) fairness constraints ϕr : Controller structural assumptions + possible (weak) fairness constraints Express (desired) requirements of the form ^ ^ ♦φi −→ ♦ψj n

m

Expressive enough for our problem! 11 / 16

Conditional Planning w/ Loops under SFC as LTL Synthesis ϕ = ϕa −→ ϕr In our case: 1 ϕ = ϕinit ∧ ϕtrans ∧ ϕrc , where: a a a a I I I

2

ϕinit a : initial condition (D state) ϕtrans : D transitions and goal achievement a ϕrc a : constraints on D runs (phrased as weak fairness)

ϕr = ϕtrans ∧ ϕgoal , where: r r I I

ϕtrans : one executable action at each point (plan structure) r ϕgoal : achieve desired goal γ (phrased as weak fairness) r

12 / 16

Main Results Theorem (Soundness & Completeness) Conditional Planning w/ Loops under Strong Fairness Constraints can be reduced to LTL synthesis for GR(1) formulas

Theorem (Complexity) Building a conditional plan with loops under strong fairness constraints is EXPTIME-complete Same complexity class as Conditional Planning w/ Full Observability!

Implementation Actual system available: TLV Based on (global) Model Checking techniques

13 / 16

Other Results (See paper)

In general, approach shown effective for: 1

Goals of the form ϕ = ψUφ (achieve φ while maintaining ψ)

2

Planning Programs [DPS10], whose atomic instructions are goals φUψ, can be captured and realized

3

Component-based Planning: actions offered by (finite-state) devices, possibly subject to strong fairness constraints

14 / 16

Conclusions and Future Directions

1

Conditional Planning w/ loops, with non-local constraints explicitly asserted

2

More general but not computationally harder than Conditional Planning w/ out Loops

3

Tackled as an LTL synthesis problem, actual system available

4

Suitable for extended scenarios (Planning Programs, Component-based Planning)

1

Performance evaluation to be carried out

2

Plan quality: e.g., avoid loops when not required

3

Planning-oriented techniques and heuristics

15 / 16

References Edmund M. Clarke, Orna Grumberg, and Doron A. Peled. Model checking. The MIT Press, Cambridge, MA, USA, 1999. A. Cimatti, M. Pistore, M. Roveri, and P. Traverso. Weak, strong, and strong cyclic planning via symbolic model checking. Artificial Intelligence Journal, 147(1-2):35–84, 2003. Giuseppe De Giacomo, Fabio Patrizi, and Sebastian Sardina. Agent programming via planning programs. In Proc. of AAMAS’10, Toronto, Canada, 2010. To appear. Yonit Kesten, Nir Piterman, and Amir Pnueli. Bridging the gap between fair simulation and trace inclusion. Information and Computation, 200(1):35 – 61, 2005. Nir Piterman, Amir Pnueli, and Yaniv Sa’ar. Synthesis of reactive(1) designs. In Proc. of VMCAI’06, volume 3855 of Lecture Notes in Computer Science (LNCS), pages 364–380. Springer, 2006. Amir Pnueli and Roni Rosner. On the Synthesis of a Reactive Module. In Proc. of POPL’89, pages 179–190, 1989.

16 / 16

Generalized Planning with Loops under Strong ...

Nondeterministic Planning Domain D: A finite set P of propositions –whose subsets are called states–, capturing all domain's relevant features. A finite set A of ...

367KB Sizes 0 Downloads 27 Views

Recommend Documents

Generation Capacity Expansion Planning under ...
2014 IEEE Power & Energy Society General Meeting (IEEE-PES-GM 2014), National Harbour, USA, Jul. 27-31, 2014. ... of reliability constraints [4], renewable integration [5], [6], emissions control ... 2-stage Stochastic Mixed-Integer Programming (SMIP

Cosmic Loops - PhilPapers
is locally irreflexive, asymmetry and transitive iff when we restrict the domain of entities quantified over to some domain D, then for all x in D, x does not ground x, ...

Cosmic Loops - PhilPapers
3 we could brutely stipulate grounding connections between different entities or ..... actually does in cases we are familiar with: I am part of the Milky Way Galaxy,.

Interpersonal Pathoplasticity in Individuals With Generalized Anxiety ...
2000), have biased social judgment regarding their negative impact on others (Erickson .... ology does not permit derivation of clusters that match the data best. Indeed, in Salzer et al. ..... behavioral treatment plus supportive listening (Newman e

Interference Channels with Strong Secrecy
Here uN is the lattice point chosen from Λ∩V(Λc), and dN is called the ..... k,1 +uN k,2 mod Λc,k = 1, ..., M. The shorthand d = ¯d to denote d. ¯N j = ¯d. ¯N.

Your Rights to Choices Counseling and Discharge Planning under the ...
and services to enable an individual to live in the community with support and services. ... complete list of funders, go to http://www.disabilityrightsca.org/.

Motion planning under bounded uncertainty using ...
Department of Electrical and Computer Engineering. University of Illinois at ... system, show that the position of this ensemble is controllable, and derive motion ..... apply the Lie algebra rank condition because the configuration space is no ...

Your Rights to Choices Counseling and Discharge Planning under the ...
administer an MDS assessment to all nursing facility (NF) residents. The MDS ... area by calling the California Department of Health Care Services at the number ...

Optimal Mobile Sensor Motion Planning Under ...
Keywords: Distributed parameter system, sensor trajectory, motion planning, RIOTS ... (Center for Self-Organizing and Intelligent Systems) at Utah State Univ. He obtained his ...... sults, in 'Proc. SPIE Defense and Security Symposium on Intelligent

PJ Stanbridge Planning in under-developed countries ...
irrigation was feasible, but they also drew ... control and operation” ... SABI-LIMPOPO AUTHORITY: IRRIGATION PROJECTS: S.E. LOWVELD OF RHODESIA. 30.

drum loops rock arty.pdf
Loops page 10 audio warez professional audio software. community. Rock/pop drumloops set 2 complextro music. Loopz best. drumloops! android apps on ...

Generalized relevance LVQ (GRLVQ) with correlation ...
Jan 10, 2006 - correlation in order to make its benefits for data processing available in compact prototype ... as the result of a self-organizing map (SOM), provides first hints about ...... MDS visualization technique, their expressiveness can be.

Optimality of designs with generalized group divisible ...
be arbitrary. Applying Lemma 2.1(1) and comparing to zd?1 shows that if d is to be .... resulting in simpler conditions that apply to the broader class of GGDDs including bi- nary and ... the key tools in MV-optimality arguments, will be needed.

PDF Generalized Linear Models with Random Effects
Data Analysis Using Hierarchical Generalized Linear Models with R · Hands-On Machine Learning with Scikit-Learn and TensorFlow · Generalized Additive ...

Generalized relevance LVQ (GRLVQ) with correlation ...
dInstitute of Computer Science, Technical University of Clausthal, Germany. Available online 10 January 2006. Abstract .... degrees of freedom of the cost minimization are the prototype locations in the weight space and a set of adaptive ...

Dynamic Facility Location with Generalized Modular ...
is necessary to ensure a fair representation of the cost structure found in practice. The costs to ...... A Dual-Based Procedure for Dynamic Facility Location.

The Generalized Random Priority Mechanism with ...
Jan 12, 2016 - finite-market mechanism that keeps feasibility, ex post individual rational- ity, and ex ..... 1. We see more formal arguments in subsequent sections, with the general model presented in the next section. 3 The Model. 3.1 Finite Market

New Insights on Generalized Nash Games with Shared Constraints ...
New Insights on Generalized Nash Games with Shared Constraints: Constrained and Variational Equilibria. Ankur A. Kulkarni. Uday V. Shanbhag. Abstract—We .... in the community of operations research and computational game theory it has been a common